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Introduction to the English Edition 
by Dr. Steven Wiggins 


When faced with a new problem, an often effective strategy for solving it is to 
somehow transform it into the form of a problem previously solved. Obviously 
there are no general recipes for implementing such a procedure; instead one 
must usually rely on insight and luck. 

However, in the context of the “local” theory of ordinary differential equa- 
tions (“local” means in a neighborhood of some invariant manifold) such a 
procedure for reducing an ordinary differential equation to the “simplest possible 
form” (what this phrase means can be precisely defined and depends on the par- 
ticular problem) or normal form is available and goes by the name of the method 
of normal forms. 

The method of normal forms is not new and is usually attributed to Poincaré 
although some of the basic ideas of the method can be found in earlier works of 
Jacobi, Briot and Bouquet. Following Poincaré, the method was further devel- 
oped and utilized in the works of Lyapunov, Dulac, Birkhoff, Cherry, Siegel, 
Moser, Sternberg, Chen, and Gustavson. 

In this book, A. D. Bruno gives an account of the work of these mathemati- 
cians as well as the results of his own extensive investigations on the subject. The 
book begins with a thorough development of the analytical techniques necessary 
for the implementation of the theory as well as an extensive description of the 
geometry of the Newton polygon. He then proceeds to discuss the normal form 
of systems of ordinary differential equations giving many specific applications of 
the theory. An underlying theme of the book is the unifying nature of the method 
of normal forms regarding techniques for the study of the local properties of 
ordinary differential equations. Indeed, Bruno shows that many of the techniques 
used for studying ordinary differential equations having a small parameter (e.g. 
the method of averaging) may be viewed as a special case of the method of 
normal forms. In the second part of the book he shows, for a special class of 
equations, how the method of normal forms yields classical results of Lyapunov 
concerning families of periodic orbits in the neighborhood of equilibrium points 
of Hamiltonian systems as well as the more modern results concerning families of 
quasiperiodic orbits obtained by Kolmogorov, Arnold, and Moser. 

One of the more beneficial results of the recent explosion of interest in non- 
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linear dynamical systems is the increased interaction between mathematicians 
and applied scientists. Regarding the method of normal forms, this has led to 
new developments in the theory as well as a variety of applications of the method 
to bifurcation theory, fluid mechanics, and structural mechanics. The following 
references supplement those given by Bruno. 
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Basic Notation 


x = dx/dt 
X =(X,,-.-,X,) Y =(y1,---3 Vn) 
A = (Aq, .-+ 5 An) Q = (41, +--+ An) 

Scalars are denoted by small letters, vectors and matrices by capitals (only in 
Chapter V are capital letters used for scalars). Vectors are written as rows, but 
are considered to be columns in matrix multiplication. 

(Q, AD = GA, + adn + °° + Ann 
xX? = x1 %X_",..x, 
IQl=41 +42 +" +4, 
1Q] = |i] + laa] +++ + 14a! 
|x| = (xil, Ixal, eee IXnl) 
E,—the k'* unit vector. 
E™—the m x m identity matrix. 
V > 0 means that v, > 0,...,0, 20 
A—the complex conjugate of A. 
O(X) = x, F(X) = xi > fioX?, 
Fo = (fiehre: eas »Ing)s 

F=(f,,.-.sSp) = Fo X% 
R{—the space of exponents 
R3—the conjugate (dual) space to R{. Sets in these spaces are denoted by 

boldface Latin capitals (L, M, ...) and by the letters 7, 4. 

L—a linear subspace of R{ consisting of all real solutions Q of the equation 

<Q, A> = 0. 

O = {Q: Q > 0}—the non-negative orthant (quadrant, octant, ...) 
Z"— the integer lattice in Rj. 

D = D(S) = {Q: fo # 0}—the support of the series f =)" foX?. 
A—the convex huil of the set D. 

1 —the closure of 4. 

6f——the boundary of I. 

T;®—a face of the polyhedron I. 

f—Newton open polygon, a part of the boundary éF. 
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V{—the tangent cone to the face 5. 
Uf?—the normal cone to 55. 
U(e)—a set in R3, corresponding to the face J} and to the number « > 0. 
Rj or C§—the basic space with coordinates X, Y or Z,... 
Capital script letters denote sets in R§ (or C3) and classes of functions and 
power series: 
%—a neighborhood of the point X = 0, 
MU (e)—a part of %, corresponding to the face J;, 
v; —the class of power series, the supports of which lie in the set Vj. 
A tilde is placed below the capital Greek letters, Z, H, M, N, E to distinguish 
them from the Latin letters, Z, H, M, N, E, which are similar. All asterisks have 
the same meaning, regardless of type size. 


Introduction 


1. Classification of Equations 


In many cases, problems in mechanics, physics, and other natural sciences 
can be reduced to the solution of a system of equations 


FlX15 065%) =O, j=l,....m, (1) 
where the f; are polynomials in the variables, 
Sie i PS es > j=l,...,m, (2) 


or else are close to polynomials. Depending on the nature of the variables x; and 
coefficients a, different types of equations are obtained— types which are studied 
by different mathematical sciences. Generally speaking, the classification of these 
types takes the following form: 

I. The coefficients a are integers or rational numbers, as are the desired 
solutions x;. These are the problems of number theory. 

II. The coefficients a and solutions x; are real or complex numbers. These 
problems arise in algebra—specifically, in algebraic geometry and in the theory 
of analytic sets. 

IIE. The coefficients a are real or complex functions of some variable t, the 
unknowns are functions of this variable, x, = y,(f),-.., X, = y,(t) and derivatives 
of these functions, x; = d*y,/dt°(j > 1). The problem consists of seeking those 
functions y,(t),..., y,(t}) which satisfy system (1). Such problems are treated by 
the theory of ordinary differential equations. 

IV. The coefficients a are functions of several variables t,,..., t,, the unknowns 
are functions of these variables, x, = y,(t,,..., t,), ---. X1=ilty,.--, t,) and 
their derivatives: 

ey; 


x= j>olss=sytorts,. 
i~ arare...cte? ! 


The problem of finding functions y; which satisfy (1) belongs to the theory of 
partial differential equations. 
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Each type can be further divided into linear and non-linear problems. System 
(1) is called linear if each of the variables x; appears in the polynomials of 
(2) only in the first or the zeroth power; that is, if, in each term of the polynomial 
of (2), the q, are either zero or one and their sum is not more than one. If this is 
not the case, then system (1) is non-linear. Linear problems are normally much 
easier to solve than non-linear ones. 

Further, we shall distinguish between local and global problems. A local 
problem consists of investigating those solutions X of system (1) which lie in 
an arbitrarily small neighborhood of some known solution X°, for one or another 
definition of neighborhood. It is important that the size of the neighborhood not 
be essential; it is enough to find all solutions in some neighborhood of the 
solution X°. A global problem, on the other hand, consists of seeking all solutions 
to system (1) which belong to some fixed set—for example, for a problem of 
the second type finding all complex solutions X. Global problems are usually 
more difficult than local problems. 

To date some general approaches to non-linear problems have been developed, 
particularly in the local case. Non-linear problems are identified as being either 
quasi-linear or essentially non-linear. Quasi-linear problems are solved either 
by taking into account nonlinear “perturbations” (the classical approach) or 
through a change of coordinates which simplifies (1) and transforms it into 
a so-called “normal form” (this is the modern approach). In essentially non-linear 
problems, one must know how to choose a first approximation to system (1) 
(Le., a truncation) that leads to linear inequalities in the exponents q,,..., q, of 
expression (2). These inequalities are solved by means of a geometrical construc- 
tion (“Newton’s polyhedron”). By means of power transformations, non-linear 
in X but linear with respect to the qg; and In x;, one can succeed in simplifying 
the truncation. By this means, one can reduce an essentially non-linear problem 
to several quasi-linear problems. 

Remarkably, essentially non-linear problems are thus closely connected with 
certain linear geometry in the “space of the exponents.” 


2. Fundamental Ideas of the Local Method 


We will explain these ideas by considering those solutions of a system of 
ordinary differential equations 


dX/dt = ®(X) (3) 


which lie in a neighborhood ¥ of the stationary solution X = (x,,...,x,) = 0. 
We consider in parallel the cases of real and complex X and ©. Here, the function 
(x) is assumed to be analytic or sufficiently smooth in the neighborhood &%. 
We make no further restrictions: we even consider cases of degeneracy and of 
resonance. 

We shall solve the problem posed above by constructing a special local 
coordinate transformation which will change (3) into an integrable system. Such 
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a transformation exists for the entire neighborhood only in comparatively simple 
cases. In complicated cases, the neighborhood being investigated must be divided 
up into parts, each with its own local coordinates with respect to which the 
system is integrable. Generally speaking, the construction of these coordinate 
changes and decompositions is accomplished step by step, and not all at once. 
Ateach step we construct a finer decomposition, and in each part of the neighbor- 
hood we introduce a coordinate system within which the system is simpler. 
Working under very general assumptions, we can arrive after a finite number 
of steps at a system which is integrable in each part of the neighborhood. 

In systems arising in mechanical, physical, and astronomical problems it is 
usually sufficient to go through one (occasionally two) such steps. The foundation 
of this local method rests on two basic ideas: 

1. The normal form. If system (3) has a non-zero linear part, then we can 
transform it into a normal form, which easily reduces to a system of lesser order 
with a vanishing linear part, and is often immediately integrable. Various cases 
of such a normalizing transformation have been considered by Poincare, Picard, 
Horn, Dulac, Birkhoff, Cherry, Siegel, Moser, Sternberg, and others. A definitive 
form, a method of reduction of order and general properties of the normal form 
have been described by the author [Bruno 1964-1976]. 

2. The reduction of complex singularities. If the expansion of the vector- 
function @(X) in powers of the x; contains no linear terms, then we may select 
a finite number of “truncated systems” 


dX/dt = B(X) (4) 


and correspondingly divide the neighborhood ¥ into parts &{(c) such that 
each part %{(¢) is a curvilinear cone touching the stationary point, while the 
truncation (4) is the first non-trivial approximation to system (3) in %(e). 

This is effected by the construction of a certain polyhedron in the exponent 
space. Further, for a simple (in some sense) truncation (4), we can construct 
a normal form of system (3) in the part MU? (e)—that is, we can simplify system (3) 
in that region. For a more complicated truncation (4) we must make a birational 
(power) transformation, blowing up the stationary point into a manifold .@ and 
the region %(¢) into a neighborhood @’ of this manifold. It is then necessary 
to find the stationary points in .#% and to study their neighborhoods, which 
will be parts of the neighborhood %’. In each of these parts the system will be 
in some sense simpler than the initial system, and we can employ anew our 
simplifying and reducing constructions. This method of reducing singularities is 
analogous to the “multiple sigma-process” of algebraic geometry and has its 
origin in a Newton polygon. For a two-dimensional system of differential equa- 
tions (3), various truncated systems (4) have been selected and studied by Briot 
and Bouquet [Briot, Bouquet, 1856], Horn, Frommer, and others. This has been 
done for multi-dimensional systems by the author [Bruno, 1962, 1965]. 

The local method, naturally, has two parts. The first—the algebraic part— 
consists of finding an algorithm for constructing the required formal power 
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series. The second part consists of interpreting these series by means of analytic 
functions (Poincaré, Dulac, Siegel, Pliss, and the author [Bruno, 1971]) or 
smooth functions (Birkhoff, Sternberg, Hartmann, Grobman, and others), or in 
estimating the accuracy of approximate integration by means of these methods 
(Birkhoff, Siegel, Moser, and others). 


3. The Contents of this Monograph 


The main objects of study of the present work are local non-linear problems 
of the second and third types, though their study sometimes reduces to that of 
linear problems of the first or fourth types. In addition, our attention is primarily 
directed towards analytic systems (1), where the f; are analytic functions of their 
arguments. 

Chapters I and II consider two-dimensional systems in great detail. Chapters 
III and IV treat multi-dimensional systems in slightly less detail. Chapter V 
applies the local method to concrete mechanical problems. 

§1 of Chapter I is devoted to the solution of linear inequalities, and in 
connection with this discusses geometry in the plane R? and the dual plane R3. 
The techniques developed there will be the basis of the “geometry of exponents”. 
§ 2 of Chapter I presents methods of solving an analytic equation 


f(%1,%2) =90 


in the neighborhood of the critical point x, = x, = 0.§3 applies the local method 
to the study of the level curve f(x,,x,) = c in the neighborhood of the critical 
point. This is the simplest problem with which we can demonstrate the whole 
apparatus of the local method. 

Chapter IT treats the system of differential equations 


ax, /dt = 1(X1,X2) ? 
dx,/dt = 2(x1,X2) ? 


in the neighborhood of a singular point using the local method. 

Chapter III systematically presents the theory of the normal form of the 
system of differential equations (3) in a neighborhood of the elementary singular 
point X = 0. §4 of the chapter extends this theory to a neighborhood of an 
invariant manifold. 

§1 of Chapter IV treats the application of the local method to the solution 
of system (3) in the neighborhood of a non-elementary singular point. §2 gives 
a review of recent investigations using the geometry of exponents (Newton’s 
polygon or polyhedron). 

§ 1 of Chapter V considers the problem of the influence of nutational oscilla- 
tions on the rate of precession of a heavy gyroscope in a Cardan suspension. 
This example illustrates the ideas of §§ 1, Chapters II and HT. §2 of Chapter V 
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treats the problem of the existence of periodic oscillations of a satellite in the 
plane of an elliptical orbit; this illustrates the methods of §§3 and 4 of 
Chapter IIT. 

The treatment of Chapters I, II, and V and §§1, 2 of Chapter III is detailed 
(especially Chapters I and II). The last half of Chapter III and all of Chapter IV, 
in contrast, are more review and problem oriented. Chapters I and II are based 
on lectures given by the author [Bruno, 1973a] and are written in the form ofa 
text book. These chapters contain detailed proofs, many figures, examples, and 
exercises. They should be understandable to students who have completed two 
years of university study. Chapters III and IV are based on work by the author 
[Bruno, 1971, 1972a] and a series of his articles. The presentation is such that it 
should be possible to read each chapter independently. For a short outline 
of the local method in the author’s own translation see the author’s lectures 
(Bruno, 1987]. 

A recent development of the local method can be found in the following works 
by the author: 


On stability in a Hamiltonian system. Mat. Zametki, 40, No. 3, 385-392 (1986) [Russian]. Math. 
Notes, 40, No. 3, 726-730 (1986) [English] 

Bifurcations of periodic solutions in a symmetrical case of a multiple pair of imaginary eigen- 
values. In: The Numerical Solution of Ordinary Differential Equations. Akad Nauk SSSR Inst. Prikl. 
Mat., 161-176, 239 (1988) [Russian] 

The normal form of a Hamiltonian system. Usp. Mat. Nauk, 43, No. 1, 23-56 (1988) [Russian]. 
Russ. Math. Surv., 43, No. 1 (1988) [English] 

The normalization of a Hamiltonian system near a cycle or a torus. Usp. Mat. Nauk, 44, No. I 
(1989) [Russian]. Russ. Math. Surv., 44, No. 1 (1989) [English] 

On small divisors. Banach Center Publications. Warsawa: PWN, v. 23 (1989) [English] 

On the question of stability in a Hamiltonian system. Ibid. 


[Translator’s note: the author’s name is often spelled “Bryuno” in translations, 
but “Bruno” is the spelling used throughout this monograph. ] 


Chapter I 
Foundations of the Local Method 


§1. Linear Inequalities in the Plane 


1.1. Basic Definitions 


By R? we will denote a real two-dimensional vector space. That is, R? is 
the real plane, with Cartesian coordinates q, and q,. The points of this plane 
are two-dimensional vectors Q = (q,,q2). If p,, p, and c are fixed real numbers, 
then the solutions of the equation 


41P1 + 42P2 =C€ (1) 


lie along a straight line in R? perpendicular to the vector P = (p,, p2). 

Now let us vary the magnitude of c and consider the lines defined by (1) for 
different values of c. These will be mutually parallel lines. If c = 0, the line will 
pass through the origin, Q = 0; if c > 0, the line will be displaced in the direction 
of P, and the displacement will increase with increasing c; finally, when c < 0, 
the line will be displaced in the opposite direction (figure 1). Each of these 
lines divides the plane into two half-planes: the positive half-plane, in which 
41P1 + 92P2 > c, and the negative, in which q;p, + q2p2 <c. Note that if we use 
the scalar product <Q, P> = q, Pp, + q2p2, then equation (1) can be written in 
the form 


Q,P>=c. 


From now on we will consider the vectors P and Q, the two factors in this scalar 
product, to be points in two distinct planes: 


QeR?, PeR3. 


The reason for this separation will become clear as we proceed. 

Let 4 be a point set in R?; it is convex if, along with any two of its points Q, 
and Q,, the set contains the entire line segment joining the two points. The set 
A is called the convex hull of a set D if 4 is the smallest convex set containing D. 
For example, the convex hull of two points Q, and Q, is the line segment which 
joins them; the convex hull of three points is the triangle with its vertices at those 
three points. This fact can be expressed by the formula 
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Fig. 1 


4= {0:0 = 46,0, + 6,9, + 6303; 
6, 6, 6, = 0; 5, + 6, + 63 = 1; 01, Q2,03€D} . 


If D is a finite set of points, then its convex hull 4 will be a closed polygon 
whose vertices are points in D. If a set D contains infinitely many points, its 
convex hull may not be closed, and may not be a polygon. For instance, let 


D = {Q: q, = 1/m, q, =0;m = 1,2,...}, (2) 


then 4 is the half-open interval (0, 1] on the g, axis. A second example: the convex 
hull of a circle is a disk. 

Let the set D < R? and the vector P € R3 be fixed, and let c = sup<Q, P) 
for all Q € D. Then the equation 


{Q,P> = 


determines in the plane R? the support line Lp of the set D with respect to the 
vector P. The inequality 


<Q,P> <e 


defines the corresponding supporting half-plane LS”. The half-plane <Q, P) > c 
contains no points of D; however, any half-plane determined by 


<0,P>>c-é, e>0O 


does contain points of the set D. If D consists of a finite number of points Q, 
then the supremum of the scalar products <Q, P» is equal to the maximum, and 
some points of D lie on the support line Lp. The support line for an infinite 
set D may not contain points of D (for example, the set (2), where P = (— 1,0)). 
The intersection 

DoaL, = D, 
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consists of all points of the set D for which the scalar product <Q, P> has its 
maximum value. 

By J let us denote the intersection of all the supporting half-planes of D; 
that is, 

= () Lf forall PER}. 
P£0 
Clearly, Fis a closed convex set which contains the convex hull 4 of D: 
rFoApmD. 


As long as Fis a closed set, it contains the closure of 4. 


Theorem 1. For every set D, the intersection I of all its supporting half-planes 
coincides with the closure of the convex hull A. 


For the proof, see § 1 of Busemann [1958]. 

We call the intersection of the set J with a support line L, a face. The faces 
of the set J lie on its boundary 07; conversely, the boundary Of is composed of 
the faces of I. If is a polygon, then its faces are edges and vertices. We will 
denote the faces of I” by J;, where d is the dimension of the object (i.e. d = 0 
for a vertex and d = 1 for an edge) and j is its number. On each face J; we 
identify a subset of D, 

bY = Ge AnD : 


consisting of those points of D which lie on the face J;. We will call the face 
1; proper if it has the same dimension as the convex hull of D{®; otherwise, 
we call J; improper. 


Example 1. D = {0;,Q2,Q3}, where Q, = (1,0), Q2 = (0,1), and Q; = (1,1) 
(figure 2). The support line for the vector P, = (— 1, — 1) is given by the equation 


<Q,,P,> = —1 (since (Q,, P)> = (Q2,P,> = —1,<Q3,P1> = —2 < —1). Thus, 


. 23 


0 @ 


Fig, 2 
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the supporting half-plane Lf) is defined by the inequality —q, — q2,< —1. 
Similarly if P, = (— 1,0), the support line Lp, is given by the equation q, = Oand 
the supporting half-plane Lf, by the inequality —q, < 0. Finally, the support 
line Lp, and the supporting half-plane Lp,’ for P; = (1,1) are gq, + q2 = 2 and 
41 + 42 < 2, respectively. The polygon J is the triangle with vertices Q,,Q>, Qs. 
Thus 

H=0,,  7=1,2,3; 


the edges Jj") are the line segments connecting Q, with Q,,,. Correspond- 
ingly, 
Di? = Q;, J=1,2,3; 


Di? = {21,22} ’ Di) = {Q2,Q3} > DS? = {23,01} : 
Here, all faces are proper. 
Example 2. D is the interior of the unit circle: 
D={Q:gi +43 <1}. 
Then 4 = D, and J is the closed unit disk: 
r=(O:g+a< 1}. 


The boundary éF of I is the unit circle gq? + q} = 1. Each of its points is a face 
T;. All Df are empty (the dimension is — 1); therefore all faces are improper. 

Note that an edge J;"” will be either a line (thus unique), a half-line (of which 
there can be no more than two), or a line segment. 


1.2. Cones 


A set K < R? is called a cone (see Goldman and Tucker, 1956) if, along with 
the point P, it contains the ray cP (for all c > 0). Thus, the cones in R? are: the 
origin P = 0, any ray, any sector, and the whole plane. A cone K is convex if it 
is a convex set. Thus, a sector is a convex cone only if its interior angle is no 
larger than zx. 

Let Lp be the support line of a set D for a vector P; then for all c > 0, 
L,p = Lp. That is, all vectors P’ for which Lp is the support line lie along the ray 
P = {cP,c > 0}. 

Let us denote by U the set of all vectors P ¢ R32 for which there exists 
a supporting half-plane L{” for the set D. 


Theorem 2. The set U is a convex cone. 


Proof. Let two vectors P, and P, lie in the set U. We will show that U 
contains every vector P = a, P, + «,P,, where a,, #, 20. The set I has two 
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supporting half-planes, Lp,’ and L{,); consequently, it is contained in their 
intersection: 
-) ce 
PoLpaLy. (3) 


If P, and P, are not parallel, the support lines Lp, and Lp, divide the plane 
Rj into four parts; one of these is LE) 0 Lf,. This set has support lines for all 
vectors P lying between the rays {cP\} and {cP,}. By virtue of (3), [ also has 
support lines for all such P. 

If P, and P, are parallel, on the other hand, then they lie either along the 
same ray or else along the same line. Either of these sets is convex. The theorem 
is proved. 1] 


We will call the set U the normal cone of the set D (or of the set I’). 

Let J; be a face of some closed convex set J. The set U{ of all vectors 
P € R3, for which the support line Lp intersects I precisely at the face T® is 
called the normal cone of the face I“: 


US = {P: FY =Lear} . 


It is easily seen that U is a convex cone. The union of all the normal cones U® 
of the faces J; is the normal cone U of the whole set 7. If 5° = Leal and 
Oe Lp, then 
{<Q,P>=<O,P> forallQe ri, 
<O,P><<O,P> forallQer\G5® . 


Therefore, the vectors in the normal cone U™ are determined by the system of 
equations and inequalities (4). The first of these conditions implies that the line 
<Q, P> = <0, P> contains the face 1; the second implies that this line is the 
support line to the set /. But then it must be the support line to the set D, so 
that the second condition of (4) can be rewritten as <Q,P) < <O, P) for all 
QeD\D". 

If the boundary subset D/® is non-empty, we can take 0 € D{. Finally, if 4) 
is a proper face, the first condition of (4) need only be applied to the points of 
D. Thus, for some point 0 € Di” 


Uo = {r: <Q,P> =<6,P)> for all Qe D®, 
* ~t"<0,P><<G,P> for all Qe D\D!” 


(4) 


(5) 


Example 3 (continuation of example 1). D = {{Q,,Q2,03};Q1 =(1,0),Q, = 
(0, 1), Q3 = (1, 1)} (figure 2). Here, the cone U will be the entire plane R3. For 
the edge I"), we have (in agreement with (5)) the normal cone 
i: (Q1,P> = <Q2,P>.) 
UP 0s, Pd < OnPYS” 


Taking into account the numerical values of the Q,, we have 


(ty 
U, 
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Fig. 3 


US? = {P: py = P2.Pi + P2 < Di} 
or 
UW? = {P: py = p, <0}. 


Thus, the normal cone U\’) is the ray in the p,, p, plane which bisects the third 
quadrant (figure 3). This ray is perpendicular to edge [\") and is directed out of 
the polygon I. Similarly we find the normal cone 


_ Sp. <Q2P> = <Qs,P>| 
de (Oi, P)<<On,P4 


= {P:p, =0,p, > 0}. 


UW 


In the p,, p. plane the cone UY? is a ray orthogonal to the edge I” 
(figures 2 and 3). Finally, 


uw = Sp. <Q3,P> = <Q1,P>| 
oP 9. Py <O,,PYf 


={P:p,>0,p,=0} . 


This is also a ray (figure 3). We will now find the cones of the vertexes. We first 
note that the vertex J; contains a single point of the set D, so only the 
inequalities in (5) are meaningful. Thus, we have the normal cone 


Uf? = (P: <Q2,P> < (Q1,P>;<Q3,P> < (Q1,P>} - 
Taking into account the coordinates of the points Q;, we obtain 
UP?) = {(P: pr < Pi,Pi + P2 < Pi} = {P: 2 < PisP2 <0} - 


The normal cone U is thus the sector of the p,, p, plane bounded by the rays 
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U and US (figure 3). Similarly, we have the normal cone 
US) = {P: (Qi, P> < <Q2, P>, (Qs, P> < (Q2, P>} 
= {Pi p, < p2,p, <0}. 
This is the sector of the p,, p, plane bounded by the rays U{? and U%})”. Finally, 
U® = {P: (Q,,P> < (Qs, P), <Q2,P> < (Qs, P)} = {P > 0} , 
the sector bounded by the rays UY? and U" (figure 3). 


Example 4 (continuation of example 2). Again, U is the plane R3. 

Every point O of the unit circle (¢? + G3 = 1) is an improper face. Only one 
support line passes through such a point—the line tangent to the circle with 
the normal vector P = 0. Consequently, the normal cone of any face O is the 


ray {cQ,c > 0}. 


Example 5. Let D be the set of points in the plane R? with non-negative, 
integral coordinates. Then I is the first quadrant, 7 = {Q: Q > 0}. Its boundary 
oF consists of one vertex and two edges: 


P=}. TM ={Q:a=0<a}, TP ={0:4, =0<q}. 
The normal cone for the vertex I) is 
U® = {P:<0,P> <0 forall Qe FO\E} 
Choosing for Q the values (1,0) and (0, 1), we get 
U® = {P: p, <0,p, <0} . 
The normal cones to the edges are 
UW = {P = (0, p2): pz < 0}, UW = {P = (p,,0): p, < 0}. 
The cone U = U® UUW U UW is the third quadrant of the plane R3: U = 


{P <0}. 


1.3. Degenerate Cases 


It might happen that all the points of a set D are collinear. In this case, the 
set I lies entirely on this line and is a line-segment, a half-line, or the line itself. 
It has a single edge, 7 = I, and the number of vertices does not exceed two. 


Example 6. D = {Q,,02,Q3},Q1 = (2,0), Q2 = (0,2), Q3 = (1, 1) (figure 4). 7 
is the line-segment with vertices [7° = Q, and F{° = Q,. There are three sets 
Di: Df? = Q,, DY? = Q,, and D{!’ = D. In constructing the normal cones, we 


1.3. Degenerate Cases 21 


Fig. 5 


find that there is one less inequality than in the non-degenerate case. Thus, we 
have (figure 5): 


Us? = (P: CE, P> = (12°), P>} = {P: Pi = Po} - 

This is the whole line p, = p,—not a half-line as in the non-degenerate case. 
UM = {Ps CIS, Pd < CO, Py} = {Ps p< py} ; 
UP = {P: CF”, P> < (Tf, Pd} = {P: p, < p2} . 


Here, U = Ut!) U U® U U is the entire R3 plane. 

If the set D consists of a single point Q,, then J’ degenerates to this single 
point. Thus, © = 7 = 0, = D = D®. The normal cone U® is the plane R3, 
as no conditions are placed on the vectors P. 

Thus, every set D < R? has a corresponding set I which is the intersection 
of all the supporting half-planes of D. J, in turn, has faces J}, and for each 5% 
there is a corresponding boundary subset D{ = J{@ 4D and a normal cone 
U. The order of these calculations may be written symbolically: 


D+F {19} + {Dj} + {UP} 
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1.4. Polygons 


We now consider the situation in which the set D consists of a finite number 
of points Q,,..., Q,,- Then, the convex hull 4 of D, is closed and coincides with 
the set / (Theorem 1). Now, J is a polygon whose vertices are necessarily points 
of the set D. Any remaining points of D will lie either on the edges of I’ or 
inside the polygon. It is also possible that an edge contains no points of D other 
than the vertices (see example 1). 

The polygon / is the convex hull of a finite number of points and has a finite 
number of edges and vertices; these are the /;. In principle, these could be 
arbitrarily indexed, but it is more convenient to proceed as follows. We choose 
a vertex and an orientation (clockwise, say). We designate the chosen vertex 1), 
the next vertex in the chosen direction is /{°, then J{°, and so forth. Finally, 
we will come to some vertex J; which is followed by the original vertex 1°). 
Then we can number the edges by letting edge Jj" connect J; and Jj) fof 
j <n, and connecting ;° and 7° with edge /;"). Thus, in example 1 we let 
T° = Q, and numbered the edges as in figure 2. 

The vertices J; play a special role in the polygon I. First, J is the convex 
hull of all its own vertices; second, each edge J;'” is the convex hull of its two 
vertices 5 and 159). 

The cone U is, in this case, the entire R3 plane. The normal cones Ul to 
the faces J; are given by the formulas of (5). In example 3, the normal cones 
were defined quite simply, thanks to the simplicity of the set D (which consisted 
of just three points). Generally, the number of equations and inequalities defining 
the normal cones can be reduced by using just those points of the set D which 
are vertices J; of the polygon 7. Then 


Sine CF, P> = cy, P); alas FOe eT) 


UW =} , 
ak (T, PY < <5, P), 7,0 ern” | 


Up) = (PEO, Pd < TG, PY # GO} 


Furthermore, in determining the normal cone U!” of an edge J; it is sufficient 
to use just one inequality, since the equation defines a line perpendicular to the 
edge Jj", and each inequality reduces to the same half-line, directed out of 
the polygon 7. 

In the determination of the normal cones U!® it is sufficient to use just two 
inequalities: 


Ul = (P: GY, Pd < (, P>, <5), P> < <5, Pd} , 


that is, the normal cone of the vertex J} is bounded by the normals to the sides 
GQ) and J; , which adjoin the vertex J; This follows from the fact that 
the inequalities defining the cone Uf can be written as (7; — 5, P> <0. 
All of these can be derived from the inequalities <j) — 1j,P> <0 and 
GY — Lj, P> <0, since the vectors 1; — 7; lie within the angle formed by 
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Fig. 7 


the vectors 1} — F{ and 13) — 5. That is, HO — FO = ¢,(G9 — GF) + 
c,(GP) — 5), where c,, c, > 0. 


Example 7. D = {Q,,Q2,03,Q4}, where Q, = (3,0), 0, = (0,3), Q3 = (1, 1), 
Q, = (1,2) (figure 6). The points Q,, Q2, Q3, are the vertices of a triangle I. 
We let 1° = Q,, f° = Q3, and T° = Q,. Then 1" is the edge connecting Q, 
and Q,, the edge I”) connects Q, and Q,, and the edge /{") connects Q, and 
Q, (figure 6). Further the sets D{® are 


DY =, , DY =Q; , DY =Q, , 
Di? = {0,,Q3} , Di) = {03,02} , Di? = {Q2,04,Q:} ; 
We next find the normal cones U!”. We have (figure 7): 
UP = (P: CY”, P> = CTY, P> > (TY, P>} = {P: pz = 2p, < 0} ; 
UW = {P: (79%, P> = (Tf), P> > TY”, P>} = {P: 2p, = p, < 0} ; 


Us) = (P: 19, P> = CI"), P) > <1, PD} = {PP = 2 > 0} 5 
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UY) = (P: CF, P) < CF, PY, LE, P) <<, PY} 
= {P: po < Pi,P2 < 2p} ; 
UD) = {P: TY”, P> < (1%, Pd, I$, P> < (IY, P>} 
= {P: 2p, < p2,2p. < p,} ; 
UD) = {P: IY, Pd < CTY), P>, "P< (1, Py} 


= {P: p, <2p2,p2.> Pi} - 


1.5. Subsets of the Integral Lattice 


The collection of all integral vectors Q = (q,,q2) is called the integral lattice 
and denoted by Z?. We now consider the case in which D ¢ Z? and indicate 
some properties of the sets J, U, Jj, and UM. 

1. / has no more than a countable set of faces J}. 

2. All the vertices J; are in D, and two edges extend from each vertex. 

3. Each edge Jj", which is a line segment, belongs to 4, the convex hull of D, 
and each is the convex hull of its own vertices. 

4. The cones U™ of all the vertices and edges, which are line segments, can 
be found by formula (5). 

We call the set 


K = {P: P = B,P, + B,P.;P,éD,B, > 0,i = 12} . 


the convex conic hull of the set D < R*. We call a convex cone rational if it is the 
convex conic hull of integral vectors. For example, the first quadrant {P > 0} is 
a rational cone, since it is the convex conic hull of the vectors (1,0) and (0, 1). 

The normal cone U is the union of the cones U{®, of which there may be 
a countable number. 


Theorem 3. If D < Z?, then each rational cone K < U intersects only a finite 
number of cones U. 


We omit the proof. 

If the set D lies in the first quadrant {Q > 0} of Rj, the normal cone U of D 
contains the third quadrant {P < 0} of R3. Therefore, there will only be a finite 
number of cones U™ in the third quadrant of RZ. Consequently, the set J” has 
only a finite number of faces 5 which lie on support lines Lp with P < 0. 
There are two methods of finding these faces. 

First, it is possible to find all the faces 5 and their cones U™, and then to 
select those cones which intersect the third quadrant of the plane R3. 

Second, we can begin by finding the vertical and horizontal support lines of 
D with the normal vectors P* = (— 1,0) and P, = (0, — 1), respectively. These 
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vectors span the rays P* = (—c,0) and P,, = (0, —c) (where c > 0), which in turn 
form the boundaries of the set {P < 0}. Let q,, = ming, for Q € D. Then the 
line q2 = 42, Will be the horizontal support line. Further, let g,, = ming, for 
Q = (41,424) € D. 

The point Q, = (414:42,) belongs to the set D, and is the left-most point of 
this set on the horizontal support line q, = q2,. We denote this point by 7, 
and pass through it that slanting support line Lp (P < 0) which contains one 
more point of D. We then select the leftmost point of D on this line, denote it 
by [{°, and pass through this point a new support line which contains yet 
another point of D but does not contain [{°. We denote the leftmost point of 
D on this new line by £{°, and proceed as above. This procedure is continued 
until we come to a point Q* € D lying on the vertical support line below the 
remaining points of D; that is, Q* = (q*, q3), where qt = min q, for all Q «¢ D and 
qt = min q, for all Q = (qt, q,) € D. As a vertex Q* must be indexed: J; = Q*. 
A convex polygonal line /' is contained between vertices { and J;; this open 
polygon consists of the vertices J} and the edges connecting them, J”). All of 
these are vertices and edges of the polygon J, but not all the vertices and edges 
of the polygon F are in F. In the situation of example 7 we have 


0,=%=1, Q*=0,=T%. 


The open polygon F consists of the three vertices 1, F{, © and the two 
edges, J") and I" (figure 6). 
Thus we have 


Theorem 4. If D < Z? and lies in the first quadrant {Q > 0}, then the number 
of the boundary subsets Dp, corresponding to vectors P < 0, is finite. 


The intersection J of the supporting half-planes of D is called Newton’s 
polygon, the part I of its boundary is Newton’s open polygon. 

In earlier times the words “Newton’s polygon” denoted the polygonal line F, 
a part of the boundary of the convex hull I of a set D. More recently, “Newton’s 
polygon” has come to mean the convex hull itself—i.e., a two-dimensional set. 
We adhere to the more recent terminology, and use the term “Newton’s open 
polygon” for f°. 

Newton’s open polygon is employed in finding the solutions, near the origin 
xX; = xX, = 0, of the equation 


Daa kt XE =0 2 


where the exponents q, and q, are integral and non-negative (see below, § 2). 
Newton himself [Newton, 1937] constructed only the lower right edge of the 
open polygon, the edge abutting the point Q,, besides he used q, as abscissa and 
q, as ordinate. In Chapter 4 of Walker’s book [Walker, 1950] Newton’s polygon 
is described, though somewhat differently from our description (see also the 
review article by Chebotarev [1943]). 
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The aim of our geometrical constructions is the solution of the following 
problem: Given a set of two dimensional vectors, D; for every vector P #0 
(or P < 0), choose a boundary subset Dp for which 


<Q;, P> = <Q;, P> > Q;,0;€ Dp , 
<O,,P> < <Q;,P> ; Q,€ D\D, . 


It was shown above that solutions to this problem are the subsets D?, which 
lie on the faces J; of the Newton’s open polygon f of D. There are, however, 
other geometric methods of solving this problem. We will consider them below, 
assuming for simplicity that D consists of a finite number of points Q,,..., Qn- 
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Given a set D, let us find its boundary subsets Dp. To do this, in the s,, s, 
plane we draw a line Q; corresponding to each Q; € D; the line Q, is given by 
the equation 


(QS) = 1. 


We next take a fixed vector P and consider the line P = {cP}, where ¢ is an 
arbitrary real number. We designate by S, the point of intersection of P and 
the line Q,, ie., 
S,=¢P, <Q,S8)>=1. 
Therefore, 
<Q;,P> = cj" 
and 
_ _, _ f(mine,)* fore, > 0, ifsomec; 20 , 
a ae eae a ee , ifallc, <0. 
We divide the line P into two rays: P™ = {cP,c > 0} and P™ = {cP,c < 0}. 
If the ray P™ intersects the lines Q,, then max(1/c,) occurs at that point S; on 
the ray which is nearest the origin S, = S, = 0. If the ray P‘*) does not intersect 
the lines Q, then max(1/c;) is attained at that point S, of the ray P which is 
farthest from the origin. Thus, for every vector P, the set D, consists of those 
vectors Q; whose corresponding lines Q, intersect the ray P‘*? nearest the origin. 
If no such lines Q, exist, then the subset Dp is composed of those Q; for which 
the lines Q; intersect the ray P at the farthest point from the origin. Let us now 
consider all possible rays P“ and P™. Those points of intersection of the rays 
P) with the lines Q; which are nearest the origin make up a convex open 
polygon Q, while those intersections of the rays P“ with the lines Q;, which 
are furthest from the origin, form a convex open polygon Q‘). These open 
polygons are convex towards each other. A boundary subset Dp which consists 
of a single point Q, corresponds to an edge of the open polygon 2 = QV. QO). 
A vertex of 2 corresponds to the subset Dp consisting of all points Q, for which 
the lines Q; pass through this vertex. 
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Fig. 8 


The open polygons 6/ and 2 are dual to each other, and lie in the dual planes 
R? and R3. The s,, s plane considered above is R3, the plane for which we earlier 
used the coordinates p,, p,. (New coordinates were introduced for convenience.) 
There exists a one-to-one correspondence between the vertices and edges of dF 
and @: each vertex of 6f corresponds to an edge of 2, and each vertex of Q 
corresponds to an edge of oF. 


Example 8 (Compare example 7). Let D = {Q,,Q,03,04}, where Q, = (3,0), 
Q, = (0,3), @, = (1, 1), and Q, = (1, 2). Then (see figure 8) we have the lines 


Q, = {S:s, =4} , Q, = {S:s, =4} , 
Q,={S:s,+5,=]}, Q, = {S:s, + 2s, = 1}. 


The open polygon 2” (the heavy lines in figure 8) consists of three sides and 
two vertices (compare figures 6 and 7); the open polygon 2‘? consists of two 
sides and one vertex (the heavy dashed lines). 


1.7. Frommer’s Method 


The equation 
<Q), P> = 41;P1 + 42jP2 =F (6) 


is homogeneous in the variables p,, p, and r. We can normalize any one of the 
three variables, and then equation (6) will define a line in the plane of the other 
two. Depending on which of p,, p, or r is normalized (i.e., set equal to unity), 
we will have three possible geometric interpretations. In each case the vector 
Q; = (41;,42;) corresponds to a line. We considered the case r= 1 in the 
previous section; here, we look at the case p, = 1. In this case, every vector 
Q; = (41}.42;) € D defines a line 0, in the (p,r) plane according to the equation 


Qj + dap =r. (7) 
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For a fixed vector P =(p,,p,) with py #0, we have P = p,(1,p2/p,); thus, 
<Q;,P> = Pi(41; + 92jP2/P1) = Pir;, where r, is the ordinate corresponding to 
the abscissa p = p,/p, on the line Q,. Therefore 


P, maxy,; , ifp, >0, 
max (Q;, P> = max pyr; = i 
¢ p,miny, , ifp, <0. 
J 
The boundary subset Dp, consists of those points Q € D for which r = r*(p) = 
max,r; if p, > 0, or r = r,(p) = min,r; if p, < 0. Geometrically, this means that 
for every value of the abscissa p in the (p,r) plane, we can find a largest r*(p) and 
a smallest r,(p) among the points of the lines Q,. Depending on the sign of p,, 
we choose either the point (p,r*(p)) or (p,r,(p)) and select those lines Q; which 
pass through the chosen point. 

The elements of the set D which correspond to the selected lines determine 
the boundary subset Dp. The collection of points (p,r*(p)), for all p, is the open 
polygon ©); the collection of points (p,r,(p)) is the open polygon ®”., Each 
of the open polygons is convex, and convex towards the other. The edge of 
@ = ©) | &” corresponds to the single-point boundary subset Dp, i.e., to the 
vertex of the open polygon 6/. The vertex of the open polygon ® corresponds to 
the multi-point boundary subset Dp—i.e., to the edge of the open polygon oF. 


Example 9 (see examples 7 and 8). For D = {Q,,Q,,Q3,Q,} where Q, = (3,0), 
Q, = (0, 3), Q; = (1, Ll), and Q, = (1,2), we have the four lines (see fig, 9) 


1=B3=rn, Q,={l+p=r}, 
Q,=(3p=r}, Qg={l+2p=r}. 


Be Q, 


Fig. 9 
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The bold line represents the open polygon ©“. Its three edges correspond to 
vertices Q,, Q,, and Q, of the Newton open polygon in figure 6. The two vertices 
of the open polygon ®™” correspond to the edges [) and I" in figure 6. The 
dashed line in figure 9 is the open polygon ®*?, the two edges and single vertex 
of which correspond, respectively, to vertices Q, and Q, and to edge /{” in 
figure 6. 

This method of finding the boundary subset D, with help of the open polygon 
@ was first developed by Frommer [1928] for the study of differential equations. 
The development of this method has been carried out by Kukles [1958, 1964] 
and his students, as well as by Andreev [1970], Andreev and Gleiser [1972], and 
others. Note that the third possible normalization of equation (6), setting p, = 1, 
may be carried out in a similar fashion. 

In comparison with the open polygon oJ and the dual polygon 2, Frommer’s 
open polygon @ has the disadvantage that it is not symmetric relative to the 
coordinates g, and q, (or p,, p2). It is therefore difficult to construct a multi- 
dimensional generalization of Frommer’s open polygon. Moreover, it is hard to 
determine the changes in Frommer’s open polygon under power transformations 
(see below). From now on, therefore, we shall adopt the geometric interpretation 
of Newton. The use of Newton’s polygon in solving differential equations was 
introduced by Briot and Bouquet [1856] and applied by Horn [1894], Dulac 
[1904], and others (see the surveys in § 20 of Dulac’s monograph [Dulac, 1912] 
and in §4, section 7 of Bieberbach’s book [Bieberbach, 1953]). Apparently, 
Frommer did not read these works in the original, but instead read brief reviews 
written by Painleve [1905]. Frommer therefore developed his own, less con- 
venient method. One gets the impression that the many enthusiasts of “Frommer’s 
method” never read the works of his predecessors. This is evident from their 
devotion to Frommer’s open polygon, the absence of appropriate references, and 
the duplication of earlier results. For example, Dulac [1904] studied integral 
curves of zero and infinite order, but Frommer [1928] did not consider them at 
all, while Kukles [1958], devoted a series of works to the question, apparently 
not realizing it had already been studied (see also Andreev, 1970.) Beklemisheva 
[1971] suggests not normalizing p,, p,, or rin equation (6), considering the cone 
M = {(P1, 22.1) 41jP1 + 92jP2 <1,j = 1,...,m} in real three dimensional space. 


1.8. Affine Transformations 


We next consider the linear coordinate transformation in the R3 plane 


Pi = %1P1 + %2P2 » P = %21P1 + %22P2 
with the non-singular matrix 


O11 12 
2=( &), 
Ho, 422 


(i.e., det « # 0). In vector form, the transformation is written 
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P=oP. (8) 


The scalar product <Q, P> for Q ¢ R? and P € R3 will be invariant under 
the transformation if we effect the corresponding coordinate transformation in 
R? given by 

P=a'*Q, (9) 


where a* is the transpose of the matrix «. By the property of the scalar product, 


<BO, P> = <O, B*P>, we have 
<Q',P’> = <a-1*Q, aP> = (Q,a-*aP> = <Q,P> . 


Thus, under the corresponding transformations (8) and (9) in the R3 and R? 
planes, the scalar product of vectors in those planes is invariant. These planes 
are therefore conjugate or dual to each other. 

The affine transformation (8) is a one-to-one mapping of the plane R32 onto 
itself. Under it, lines are mapped onto lines. For example, for a fixed vector 
Q;, the line ¢<Q,, P> = c = const maps onto the line ¢Q), P> = c. Moreover, all 
linear inequalities are invariant. Let a set D € R? be given, with corresponding 
sets J, U, J}, Df and U! constructed as described earlier. Let D’ be the set 
obtained from D by applying transformation (9) and let the corresponding sets 
be I, U’, 4", D@, and US. Then 


=e, | ad = ld Fas ; Di” = at Die : 
U'=aU, UM = aU® , 


That is, all of our earlier constructions are invariant under transformations (8) 
and (9). 


Problem: Let J; be an edge (or line segment). Find a transformation of the 
form of (9) such that the edge Jj" = a*-"J5"" is parallel to one of the coordinate 
axes qj OF q5. 


Solution. Let the vector P € R3 be perpendicular to J}. If P’ = aP lies along 
a coordinate axis, then the perpendicular J;‘"” lies on the other coordinate axis. 
It is therefore sufficient to find a matrix a such that one of the coordinates of 
the vector «P vanishes. Such a matrix is, for instance, 


ei i) (10) 


\—P2 Pi 


1 0 
o- (7, a (11) 


where a = p,/p,. Then the vector P’ = (p,,0) lies along the coordinate axis pj, 


or 
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Fig. 10 


Fig. 11 


and the edge J;*”’ is parallel to the axis q; (figs. 10 and 11). We see that the inverse 


of matrix (11) is 
~s 1 0 
ies a hes 


1 0 if 6 
B=(; ap men a (oe ye 


that is, under multiplication of such triangular matrices, the lower left-hand 
elements simply add. 

Now consider a more complicated case of the previous problem. Let the set 
D lie in the first quadrant of the R? plane, and let us require that D’ = «*"'D 
also lie in the first quadrant. We can assume that P <0. Since D’ = a*'D, 
the requirement that the set D’ lie in the first quadrant will automatically be 
satisfied if all the elements of the matrix «~! are non-negative. But if P < 0, then 
a = p,/p, is positive, and matrix (11) satisfies the requirement. 


In general, if 
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1.9. Unimodular Transformations 


If the elements «, of the matrix « are integers, and if deta = +1, then 
a is unimodular. The inverse of a unimodular matrix is also unimodular, and is 
given by 
od —o 
an} -( a2 2) dete ; (12) 


7H 411, 


The main property of transformation (8) with a unimodular matrix « is that 
it is one-to-one on the integral lattice Z7. 

Let us now solve the problem of the last section in the class of unimodular 
transformations (8), assuming that D c Z”. In this case, all the vertices of the 
open polygon 6/ are integral. Thus, given an edge /;") with integral endpoints, 
we must find a unimodular transformation (8) under which its image, 5°”, is 
parallel to one of the coordinate axes. 

Let R = (r,,r2) be a single vector along the edge J;', ie., R is given by the 
differences between neighbouring integral points of the edge J;"”’. Then the vector 


P = (P,,P2) = (ra) (13) 


will be perpendicular to J;), where the integers r, and r, are relatively prime. 
We must find a unimodular matrix @ such that one of the coordinates of the vector 


OP = (01) Py + %12P2,%21P1 + %22P2) (14) 


vanishes. To simplify the following calculations, we assume that P > 0. We shall 
find a sequence of unimodular coordinate changes whose end result will be the 
desired transformation. For convenience, let p, > p,. Division with remainder 
yields 

P2=4,P,+p2, 


where 0 < p; < p, and a, > 0, and p}, a, are integers; p, = 0 only if p, = 1, 
since p, and p, are relatively prime. We make the transformation 


, 


Pi=P; P2 = —4,P, + P2 


ee ge 
A Ne Oe 


Under this transformation, the vector P transforms to a vector P’ = (p;, p3). If 
p, = 1, then p, = 0, and we have found the desired transformation. Otherwise, 
if p, > 1 then p, > 0 and the numbers p; and p are relatively prime; we have 
not yet found the desired transformation, though the coordinates of P’ are in 
a sense “smaller” than those of P. Let us continue the process of diminishing 
the coordinates. We divide p; by p34, 


with the unimodular matrix 
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Pi =42P2+ Pi » 


where a, and pj are integers, 0 < p{ < p2, a, > 0, and pj is the remainder. 
We have p3; = 0 only when p} = 1. We apply the unimodular transformation 


" ; ; 
P, =P, — 42P2 


P2 = P2 


1 -a, 
7 (eee. 


Applying this transformation to P’ yields a vector P” = (pj, p3) > 0. If p, = 1, 
then p{ = 0 and the desired transformation will be the product of the two 
transformations: 


with matrix 


A= O20, . 


If p, # 1, then pY > O and pj and p} are relatively prime. Again, the coordinates 
of the vector P” are “less” than those of P’. We need to continue the process. 
Suppose that the k-th step yields a vector P” = (p™, p*’) > 0. If the larger of 
the coordinates equals one, the other must be zero. Then the desired matrix a is 


1 = Oy Op-y--- yy 


and the vector «P is simply one of the unit coordinate vectors. If, on the other 
hand, the larger of the two coordinates of P is not unity, then the other is 
non-zero, since they are relatively prime. Dividing the larger coordinate by 
the smaller, we make the unimodular transformation P*t”) = a, ,, P™, where the 
vector P+!) is “smaller” than P™., Continuing this process, at some n-th step 
we obtain the unit vector P”. Then 


Oh = Wy py Hy y (15) 


and the vector P” = aP is a unit vector; our problem is solved. 
Note that each matrix , is triangular, with ones along the diagonal and with 
the position of the third element dependent on the parity of k: 


1 0 1 -a 
ie i) (, ae ia 


Example 10. Let p, = 5,p, = 17. Then p, = 3p, + 2,i¢.,a, = 3,p; = py = 5, 
p; = 2. Further, p; = 2p, + 1, so that a, =2, pj =1, p} =p; =2. Finally, 


m 


P; = 2pj, and a, = 2, py’ = 1, and p;’ = 0. Thus, 


ae i (10 
TE ag pe) Fe hg a ee ed Hee 
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7 2 
Rmeyes% =| 15 5)? 


aP =P” =(1,0). 
Note that by (15) 


As a result, 


and that the matrices «,' have the form 


Co te a) 


each «,' has all non-negative elements. As a result, the matrix a! has non- 
negative elements. It is essential here that P > 0. 

In this construction we have used the algorithm of successive division with 
remainder, i.¢., the Euclidean algorithm. In fact, we have sought a unimodular 
matrix « such that, for two relatively prime numbers p, and p., the vector (14) 
should be a unit vector. That is, 


Pr t+%2pP2=1, 
21Py + %2p2=90, (17) 
det © = 4 1%22 — 2%, = 41. 
The second of these equations implies that 
%21 = —P2; %22 = Pi - (18) 


Then the first and third equations of (17) are equivalent. The solution of the 
first for integers «,, > 0 and a,, <0 is simply a matter of applications of the 
Euclidean algorithm as seen above. 

This process can also be interpreted as the expansion of the ratio y = p,/p2 
into a continued fraction 
_ Pi l 


= My + 
P2 ‘, l 


y 
i 


1 
+ 


ay-1 + — 


n 


Here, ag is the integral part of the number }, a, is the integral part of (y — a)‘, 
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a, is the integral part of ((y — a,)7! — a,)', etc. Thus, in order to express y as 
a continued fraction, we must separate y into its integral part a, and a remainder, 
then separate the inverse of the remainder into an integral part and a new 
remainder, invert that remainder and find its integral part, and so forth. The 
theory of continued fractions (see Khinchin, 1961) tells us that the continued- 
fraction representation of a rational number is finite, and that the number 


is such that 


y- p= -—_, (19) 
4n-14n 


where q,_; and q,, are the denominators of p and y respectively, and o = +1. 
Thus, q, = p2. If we write p = p,_,/G,-,, then, in agreement with (19), 


P19n-1 — P2Pr1 =O = +1. 
To satisfy equation (17), we need to set 
O11 = GQn-10 5 O12 = —Pp1o . (20) 


Example 11 (a continuation of example 10). Let p, = 5, p, = 17. Then 


Pee 2: ah 
34a 7 PralGn-s : 


Since 


o = I. In agreement with (20) #,, = 7, @,, = —2. This, along with (18), gives us 
the matrix 


as we found earlier. 


Remark. If «P = (1,0), then &P = (1,0) for 
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mk Xo1 X22 
zmatk(™ “al 


where k is an arbitrary integer. 

We have described here solutions only to problems with positive P. If P < 0, 
we simply find the matrix « which solves the problem for — P. But if p,p, < 0, we 
find the unimodular matrix « for the vector (|p,|, |p2|); then the matrix 


be SENP, %42 2) 
OH, SNP, %22 SQN p2 


will be the unimodular matrix that causes one of the coordinates of P to vanish. 


Exercise 1. Let p, = 5, p, = 7. Find a unimodular matrix « such that one of 
the coordinates of «P vanishes. 


§2. Zeros of an Analytic Function 


2.1. A Simple Point 


Let the function f(x,,x,) be analytic at the point x, = x, = 0. Then f may 
be expanded in a Taylor series 


f(X1,%2) = > Sag: XE XP > (1) 
919220 
which converges absolutely in some neighborhood Y = {x,, x2: |x,|,|x2| <6} 
of the origin. In expression (1), q, and q, are non-negative integers and the 
coefficients f,,,, are real or complex constants. The values of x, and x, may 
likewise be either real or complex. The presentation given here will be for the 
case of a real-valued function f of two real variables x, and x,; however, 
the entire discussion applies equally well to the complex case. 
Let us write the first few terms of the Taylor series for /: 


Sf = foo + SioX1 + forx2 + faoxt + firX1%2 + forxd + °° 
Now let fo = 0 (so that f(0,0) = 0), and consider the equation 
I(X1,%2) =O. (2) 


The collection of those solutions of equation (2) which lie in the neighborhood 
% forms an analytic set (see Fuks, 1962)—specifically, a plane analytic curve 
passing through the origin. This section (§ 2) aims to study the location of this 
curve in the plane R@ in a sufficiently small neighborhood of the origin. 

If [fiol + |fo1| #9, then X = 0 (the origin) is called a simple point for the 
curve defined by equation (2); otherwise, the origin is called a critical point. 
We begin with the solution of equation (2) in the neighborhood of a simple point, 
assuming that the coefficient fo, is non-zero. 


Theorem 1: Let a function f be analytic at the origin, with f(0,0) = 0 and 
So: # 0. Then equation (2) has an analytic solution 


38 Chapter I. Foundations of the Local Method 
. k 
X2 = b(x,) = Y bhxt . (3) 
k=1 


Proof. We apply the method of undetermined coefficients. After substituting 
the solution (3) into the expansion (1), equation (2) must be an identity in x,: 


00 92 
E haart (¥ bat) =o. 


91-92 


Multiplying out and collecting terms, we obtain 


y (Di Fararbey +++ bs,)x1 = 0 (4) 


where the indices of the internal sum are restricted by 
qth to tka (5) 


Identity (4) is satisfied if the coefficient of x} vanishes for all | = 1, 2, ... that is, 
if each of the internal sums vanishes. In every such sum, only the term 9,5, 
contains the coefficient b, for k = 1; in all other terms, the indices of the b, are 
strictly less than J, since in (5) q, 2 0 and k; > 1. We denote the sum of these 
remaining terms by c,. Then identity (4) is equivalent to an infinite system of 
equations 

Ford: + C(by,.--, 5-1) = 9, [= 1,2,... 


From this system, we determine that b, = —c,/fo, for each successive I. 

By means of the method of majorants, we can show that series (3) converges 
for sufficiently small |x,|. This is the Cauchy implicit function theorem (see 
Goursat, 1933, § 184). We will not, however, prove the convergence of series (3) 
here. In general, we will give results on the convergence and divergence of the 
series without proof. 1] 


The solution x, = b(x,) of equation (2) is a simple curve in the (x,, x.) plane, 
passing through the origin. 

Several methods may be used to calculate expansion (3). For example, we 
can express the polynomials c, as functions of the coefficients b; and f,,,,. 
Alternatively, we can employ successive approximations to find the series (3). 
To do so, we make the coordinate change 


x2 = bx ty, 
in the original equation (2). Then equation (2) takes the form 


Ap yi) =O 5 


where we must seek a solution 
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yas » b,x . (6) 
k=2 
In subsequent approximations, we have 
Ve = Deer xt + Yea - 


Example 1. Let f = x, + x. + x,x,. In the notation of (1), fio = fo; = 
fir = 1 Soo = fo2 = Oand f,,,, = Oifq, + q2 > 2. We substitute x, = b,x, + y, 
into the expression for f and obtain the equation 


X, + b,x, + yy, +X, (b,x, + y,) =O. 


It is evident from expression (6) that y, is of greater than linear order in x,. Hence 
we can isolate all the terms of less than second order and obtain the equation 
x, + b,x, = 0; that is, b, = —1. We thus have the equation y, — x? + y,x, =0. 
For a second approximation, y, = b,x? + yz, we have 


f= bx? + y, — x? +b x? + x,y. =0. 


From the terms of second order in x,, we obtain the equation b,x? — x? = 0; 
hence b, = 1, and f = y, + x3 + x,y. Proceeding similarly with the third 
approximation y, = b;x? + y3, we find that f= b,x} + y, +x} + b,x44+ 
X,;y3 = 0. The terms of least order (here, third) give b,x} + x? = 0; that is, 
b,; = —1. Proceeding further will give us the series b = )}?_, (—x,)*. On the 
other hand the equation x, + x, + x,x2 = Ois easily solved as a linear equation; 
its solution is x, = —x,/(1 + x,). Writing this solution as a Taylor series about 
zero yields the same series, b, that we found above. 

Before studying the behavior of analytic curves in the neighborhoods of 
critical points, let us examine some of the local properties of curves in the plane, 
as well as the asymptotic behavior of an analytic function on such curves. 


2.2. The Order of Magnitude of a Function 


Let t be a real parameter. We will consider the behavior of a function of t, 
defined for t > ty, as t > +00. By o(1) we will denote any function of t which 
approaches zero as t > +00. If 


p(t) = br?(1 + o(1)) , (7) 
where b and p are real numbers, then as t > co we have 


ie ifp<0, 
b, ifp=0, 


(t)> 
~ ee p>o. 


Let 
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|y| 
os 
|e] 
0 t 
Fig. 12 
|p| 
|P| 
Iyil 
0 t 
Fig, 13 


,(t) = by t?*(1 + o(1)) . 


If p, > p, then for sufficiently large t we will have |@,| > |g|, and the graphs of 
the two functions will look like those in figure 12. If p, = p and [b,| > |b}, then 
|~~,| > |@| and the graphs will be similar. As an example, consider » = 1000 173 
and , = ’; this is illustrated in figure 13. Note that for very large t, these 
curves behave just like those in figure 12. Note also that when p, > p, 


eto, =b770 + o(1)) , 
0, = bb, r?*(1 + o(1)) . 
Finally, note that under the parameter change 
t= ct*(1 + o(1)) , (8) 


where c>0 and «> 0, function (7) becomes a function *(t*) = g(t) = 
beP?t*?*(1 + o(1)) = b*r*? (1 + o(1)), having the same form, but with a new 
exponent p* = ap and with a new coefficient b* = bc’. 


2.3. The Order of Magnitude of a Plane Curve 


We now consider a curve ¥ in the x,, x, plane, defined by the pair of 
parametric equations 
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x2 


Sr 


Fig. 14 


¥. e = byr(1 + ofl) , 0) 


“lx, = b,1?2(1 + of1)) . 


We call any curve defined by such a parametrization a curve of class W. We will 
examine the behavior of such curves in the neighborhood of the origin X = 0; 
then p, <0 and p, <0. We also assume for convenience in the following 
discussion that b, > 0 and b, > 0. Graphs of curves of class W are shown in 
figure 14. 

We call the vector P = (p,, p,) the (vector) order of the curve F. If we replace 
t with a new parameter t*, according to formula (8), the equations defining ¥ 
become 

x, = dycPr*P1%(1 + of1)) = ber**F(1 + o(1)) , 


X_ = by cP2¥?2*(1 + o(1)) = b¥r*?2(1 + o(1)) . 


That is, F is a curve of class W with respect to t*, and its vector order with 
respect to the new parameter is 


P* = (pt, p3) = («p1,%p2) = aP . 


Thus, to each curve ¥ defined with respect to different parametrizations, given 
by (8), there correspond vector orders P* which lie on the ray P = {P*: P* = 
aP,« > O}. 

For example, the graph of the curve 


x,=t?, 


F: a = (10) 


appears in figure 14; the curve’s order vector P = (—1, —3) and the ray P are 
represented in figure 15. Curve (10) is given by the single equation x, = x}. In 
general, if we define ¥ using the variable xj‘ as the parameter t, then the first 
coordinate of the vector order P is p, = —1. 
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Fig. 15 


Now suppose that, in addition to the curve (9), we have the curve 


Zz. fe = b,f(1 + o(1)) , 
“" Lx, = 6, 72(1 + o(1)) . 


Let us consider the curves ¥ and ¥ in the x,, x. plane (figure 14) as well as the 
corresponding order rays P and P in the p,, p, plane (figure 15). We note that, 
ina sufficiently small neighborhood of the origin, F lies to the right of ¥ as long 
as P lies to the right of P. This is easy to show if we choose x;! as the parameter. 
For example, let ¥ be given by formula (10), and tet ¥ be given by 


F: fa mtr 


ix2.=ti*, 


ie. x, = x}; then P = (— 2, —3) (see figures 14 and 15). There is thus a definite 


connection between the geometry of curves in a small neighborhood of the origin 
in the x,, x, plane and the geometry of their vector orders in the p,, p plane. 
Next we turn to general curves of the form 


xX, = bt", 
X,=b 1” . 


We say that such curves are of the class Wo. Every curve (9) of the class W can be 
confined between two curves of the class Wo in a sufficiently small neighborhood 
of the origin. 

Let us consider two such curves of class Wo: 


F*: {x = b,t?', x. = (by + €)t?*} , 
Fy: {x = byt?) xy = (by — €)t??} . 


Comparing them with curve (9), we see that the new curves have exactly the same 
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order P = (p,,p,) and differ only in the coefficients b,, b, + s, and b, — « of 
the x, coordinate. The result of this change of coefficients is that the curves F * 
and F,, lie, respectively, just above and just below the curve ¥. 


Exercise 1. Explain the behavior of points on curve (9) as t > 00, particularly 
its dependence on the vector order P = (p,, pz) of the curve (consider those cases 
in which P lies in different quadrants or along coordinate axes of the p,, p, plane). 


2.4. Asymptotics of a Function on a Curve 


We consider the asymptotic behavior of a function f(x,,x,) on a curve of 
form (9) as t-> co. We begin with the case in which f is a monomial ie., 
f =x?'x?. Then along a curve F, 


Ie = bd bg2 7 P 14st P2d2( J + o(1)) . 
The order of this function with respect to t is 


91P1 + 92P2 = <Q,P> . 


Next, consider the finite sum 


FA 
with real coefficients a;. Then 
fe = ¥, ajbPsbgeic?4s* P2921 + o(1)) . (11) 
j= 


The exponents of t in the terms of this sum will be different for different /; if 
we write Q; = (q1;,42;), then the exponents can be written as scalar products, 
<Q), P>. 

Since the terms of sum (11) with larger exponents will have larger magnitudes 
as t — oo, we can select the indices j of those terms with the largest exponent: 
r = max; Qj, P>, P fixed. Let this maximum be reached for terms with indices 
J =Jis+-+yJs. Then 


ene for j =Ji,..oJs 3 (12) 


<Q,P> <r forjAjyy sds - 
Let us select all those terms for which ¢Q;, P> = r, and create a new function 
is s 
f(x1,%2) = ¥ Aj, XP HXPe 
k=1 


Then on the curve ¥ 


fe = fe + o(l)t" = f(b,,b,)t” + toll) . (13) 
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The function f is called the first approximation to f for the vector order P or 
the truncation of the function f for the (vector) order P. Clearly, such a construc- 
tion uses only the order P of the curve ¥. Since the truncation f is determined 
by the leading term of f on the curve ¥, we must know how to find the first 
approximation to f, considering the fact that different orders P will correspond 
to different truncations f of the function f. 


2.5. Finding the Truncation 


Let D = {Q,,...,Q,,} be a set of vector exponents for the sum f. For a fixed 
vector P, the truncation f will consist of terms a;x}'4x?+whose vector exponent 
Q; = (41; 42;) Satisfies conditions (12). These conditions determine a boundary 
subset D, of the set D. Therefore, 


f= Yo ayxfuxpe (14) 
Q;eDe 

The problem of choosing the boundary subsets D, was discussed in § 1, where 
we presented the following geometrical method: from the set D we construct 
the Newton polygon F’, whose boundary @/’ consists of vertices J}'° and edges 
T;'Y; these correspond, respectively, to the normal cones U!® and U!). The 
boundary subset Dp is the set D{? = Do J;, where P € U\®. For a fixed US, 
to all Pe U® will correspond a unique boundary subset D, = D, and thus 
a unique truncation f= fe of the polynomial f. Hereafter, the normal cone 
U® will also be called the cone of the truncation f = f a), 


Example 2 (see examples 1 and 3 of §1). Let f=x, +x, + x,x,. Here 
Q, =(1,0), Q, = (0,1), and Q3 = (1, 1); D = {Q,,Q2,Q3}. The Newton polygon 
T and the division of the R3 plane into the normal cones U{® are pictured in 
figures 2 and 3, respectively. If P = (— 1, — 1) then P € U\’", and Dp = {Q,,Q>}. 
Thus, f = X, + Xp. 

A general scheme for constructing truncations and their cones may be de- 
scribed as follows. From the sum (1) we can construct a corresponding set D 
consisting of those points Q = (q,, q2) which are the vector exponents of the terms 
in the sum f with non-zero coefficients, f,,,, # 0. The set D is called the support 
of sum (1), and is sometimes denoted by supp f. From the set D we can construct 
the Newton polygon J = /(f) and find its vertices and edges J;. For each of 
the /;, there is a corresponding boundary subset Dj? = /;® > D. This subset 
D( gives us the truncation fo by formula (14). Finally, for each truncation we 
can construct a cone U%); this is the set of all vector orders P for which J is 
the first approximation to f. The described sequence of operations may be 
visually represented as the following scheme: 


t { #) 
t t 
Do {1%} > {Di} > (UP) | 
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Exercise 2, Find all of the truncations ff and their cones U? for the 
following polynomials f: 

1) xy + XQ +X, X23 

2) x2 + x3 — 3x,x. + 1x5; 

3) xe + xF 4X, x3. 

This entire construction is applicable to infinite sums f(x,, x). For example, 
if f is a series (1) with integral, non-negative exponents, evaluated along curves 
(9) which approach the origin, then P <0 and, in agreement with Section 1.5, 
the number of truncations f is finite. They correspond to the vertices and edges 
of the part of Of between the vertices Q* and Q,. Formula (13) also applies. 
Note that the truncation fi, corresponding to the vertex /;‘ = Q, is the 
monomial f,, a x? x. But the truncation bie corresponding to an edge of 6F, 
will be a quasi-homogeneous polynomial, since all of the exponent vectors Q lie 
on a single line. 


2.6. Power Transformations 


~ (ee 2 
X21 %22 


be a matrix with real elements and det « # 0. Then the power transformation 


Let 


y= = xT x gre 7 


15 
Yo = XY x5? 7 
has the inverse 
— phrryhi2z 
xX, =yi'ys", ge 4 a) (16) 
x2 = yf yp ope 


where 8 = « ', In fact, transformations (15) and (16) are linear with respect to 
the logarithms of the coordinates: 


Iny, =4,,Inx, + 4,,Inx, , 
In y. = &,Inx, + a,Inx, ; 


i. = B,,Iny, + Bz Iny, , 
Inx, = B,,Iny, + Boz Iny2 . 


and are clearly mutually inverse. We now discuss some properties of the power 
transformation (15), 

1) Transformation (15) is always well-defined and single-valued in the first 
quadrant of the Rj plane with coordinates x,, x, or );, yz. In the other 
quadrants, the transformation is real only under certain restrictions on the a,. 
The transformation is always well defined on the complex plane C@, but it is not 
always single-valued there. 
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2) Under transformation (15), the monomial x#' x4? transforms to 
xf) x9 = (yh18 yB12)91( yB21 Baz yae = yhrrds + B2142 yBi2di + B22d2 , (17) 


where f is given in (16). That is, the vector exponent Q of X transforms into 
the vector exponent 


Q' = B*Q (18) 


of Y = (y,,y2), where f* is the transpose of matrix 8. Thus, under the non-linear 
transformation (15), the vector exponents undergo a linear transformation. This 
holds for all of the terms in a sum of monomials 


f= Flake (19) 
geD 
which, under transformation (15), becomes the sum 
= i fovt'yP . (20) 
a 


where Q’ = £*Q and D’ = f*D. That is, the power transformation (15) leads to 
a linear transformation (18) in the Rj plane of vector exponents Q. As we saw in 
§1, the geometrical constructions of sets D and D’ are related by that linear 
transformation; the polygon /(D’) is just ”’ = B*/, and each of the vertices or 
edges I; < I corresponds to a vertex or edge J; < I’. Therefore, the trunca- 
tion F(X) transforms under (15) into a truncation fer that is, the operation 
of truncating sums commutes with the power transformation. 
3) Under transformation (15), the curve of the class W 


g. Pr = bit(1 + of) , 
" [xa = b,t4(1 + o(1)) 


with positive b, and b,, transforms into the curve 
Ys = (yr? f4(by 772) 2(1 + of) = Bf byrectP1*2P2(1 + ofl) , 
V2 = (by rP#)*4(b,7?2)"22(1 + ofl) = bf by227%21P1 *422P2(1 + o(1)) 
This curve is also of the class W,, but with the vector order 
P=aP. (21) 


Thus, the power transformation (15) induces a linear transformation (21) in the 
R? plane of vector orders P. This transformation is the dual of the transformation 
(18). The R? and R3 planes are thus dual spaces, and the scalar product <Q, P) is 
preserved under the power transformation. In particular, the cones of truncations 
U® of sum (19) are transformed into the cones of truncations U’ of sum (20). 
In general, all geometrical objects in the dual planes R? and R3 undergo linear 
transformations (18) and (21), and all of their mutual properties are preserved. 
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In what follows, we will mainly use power transformations of two kinds: those 
for which the matrix « is triangular or unimodular. We therefore present here 
a few properties of such transformations. 


The First Kind: Let the matrix « be triangular: 


where @,, is rational, i.e, 7, = n/m. Then 


oe 1 0 
a em nT 


If m, the denominator of «2,, is odd, then transformation (15) is defined and 
one-to-one for all real, non-zero x,, X2, y;, 2 (ie. in the R2 plane). If m is 
even, then the real transformation (15) is only defined for x, > 0 (or y, > 0). In 
this half plane, it is one-to-one everywhere but on the coordinate axis x, = 0 
(or yz = 0). 

In the complex plane C2, the transformation (15) is one-to-one only for 
integral values of «,,. 


The Second Kind: The matrix « is unimodular (ie., the a, are integers and 
det « = +1, see section 1.9). Such power transformations (15) are one-to-one 
in real and in complex domains away from the coordinate axes; this is the 
distinguishing property of power transformations with unimodular matrices «. 

Moreover, any monomial x7! x3? with integral exponents q, and q, transforms 
into a monomial y#' y%2, again with integral exponents qj and q}. This also holds 
for the inverse transformation. This is because unimodular transformations (18) 
map the integral lattice Z? onto itself. 

Power transformations were introduced and studied by the author [Bruno, 
1962, 1965] for multi-dimensional problems. In the same papers, polygons 
(polyhedra) and cones of truncation for systems of differential equations were 
introduced. In the first version of the second paper [Bruno, 1965], these topics 
were treated in depth, but editorial requirements forced the article to be cut by 
one third, and the examples and geometrical interpretations were removed. Some 
of these were later restored, and can be found in the author’s preprint [Bruno, 
1973b]. 


2.7. The Critical Point 


We return now to finding solutions of equation (2) in the neighborhood of 
the critical point x, = x, = 0. Any number of branches of the curve made up of 
solutions of equation (2) may pass through the critical point. We need to know 
how to distinguish these branches and how to find them with an arbitrary degree 
of accuracy. 
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Remark. We can assume that the coordinate axes x, = 0 and x, = Oare not 
solutions of equation (2). In fact, if this is not so, then f = fox}'x¥, where r, and 
r, are non-negative integers, and f, is a function, analytic at the origin, which 
has no factors of the form x, or x,. Then the solutions to equation (2) satisfy 
one of the equations 


x=0, xp=0, fo=9. 


The first two have trivial solutions, and we proceed to solve the third equation, 
which does not have the axes as solutions. 

We will seek solutions of equation (2) in the form of curves (9) of the class W. 
Clearly, we are only interested in those curves which approach the origin as 
t > oo; that is, those for which P < 0. 

The third quadrant {P <0} of the R3 plane is divided up into the normal 
cones U™ of the vertices and edges of the Newton open polygon / for sum (1). 
Let P € U\®; then, in accordance with (13), on curve (9) we have 


fe = f(b,,b,)t" + t70(1) . 


Consequently, as t + oo along the curve (9), the ratio f¢/t” must approach the 


value f(b, bp). If the curve (9) satisfies equation (2), then this value must 
vanish. That is, the coefficients b, and b, must satisfy the equation 


f(b) =0, B40, B40. (22) 


If d = 0 (that is, 1; is a vertex), then the truncation fo consists of a single 
term: J = axf'x§?. In this case, equation (22) has the form ab?'b3? = 0, and 
has no non-zero solutions. Thus, no curve (9) whose vector order, P, lies in the 
cone of vertex U{® can be a solution of equation (2). That is, vertices do not 
provide us with solutions, and we must turn in our search for solutions to the 
edges Jj") of Fr. 

Ifd = 1, then FOC, ,X2) is a quasi-homogeneous polynomial. We can exploit 
this quasi-homogeneity of f by applying a power transformation (15) under 
which the edge Jj‘? becomes an edge /;”, parallel to a coordinate axis. We 
considered such a problem at the end of §1. The vector P’ = «P will lie on 
a coordinate axis. The power transformation matrix « may be taken to be either 
triangular (Section 1.8) or unimodular (Section 1.9). Corresponding to the kind 
of the power transformations used, there are two different methods of solving 
equation (2). 

The first method employs power transformations with triangular matrices «, 
and seeks a solution of the form 


ia =ot! (23) 


5 
X2 H.-P 4 pl2)_—P foe, 


where the numbers p“ are positive, rational, and increasing with increasing k, 
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and o = +1. Ifall of the denominators of the p™ are odd, then in solution (23), 
o may take on either of its possible values; that is, the solution is defined for 
both positive and negative x,. If any of the p™ has an even denominator, 
however, the sign of o and x, is fixed. 

We must show that every branch of the solutions of equation (2) can be 
represented in this form, and that distinct branches have distinct expansions. 
The differences may only be in terms of very high order in the expansion (23). 
Then those branches for which the initial terms of (23) coincide lie very near each 
other in a neighborhood of the origin. 

The second method employs power transformations with unimodular matrices 
a and leads to expansions of the form 


Hy Hh + BHP 4, Pe 1,2, (24) 


where the p‘) are negative integers. 


2.8. The First Method 


We seek a curve 
t 


ee fk) = Ot, 
=" a: = bP" + o(1)) @) 


of the class W which satisfies equation (2). Here P = (—1, —p"), b} =o = +1, 
b, = b") 4 0. Equation (22) takes the form 


f(o,b") =0. (26) 


For each edge J;'”), we define the unit vector R, = (r,;,72;) to be the difference 
between neighboring integral points on that edge. We shall assume for definiteness 
that r,; <0 and r,; > 0. Then the numbers r,, and r,,; are relatively prime, and 
kR, = [°) — 5, where the positive integer k is the greatest common divisor of 
the coordinates of the vector /;\°) — 1}. Further, let the vector P, = (—r2),11)), 
P, is orthogonal to the edge J;‘") and lies in the normal cone US”. Finally, we 
introduce the number ), = —r,,/r;; then the vector P = (— 1, —y;) e US” and 
Y-1 > % > 0. 

Thus, the edge /;” corresponds to a unique exponent p'’) = ; in expansion 
(25). Since the coordinates of R, are integers, p" must be rational. We fix o 
(for definiteness let o = +1) and find all the nonzero roots b!!, ..., b& of 
equation (26). 

In this connection, if we are interested only in the real roots of equation (2), 
we can limit ourselves to finding the real roots of equation (26). Let bY be such 
a root. Then the first term of expansion (23) is already known. In order to find 
the second term, we make the substitution 


X_ = xP(b) + y). (27) 
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Then f(x,, x2) = g(x1, y), where g is a series of non-negative powers of x, and y, 
with the powers of x, rational and those of y integral. 

Reducing the series by a factor of x}, we obtain gy = g/x{; we must now find 
those branches of the curve go(x,, y) = 0 which pass through the origin. This is 
similar to our original problem, except that x, now appears in rational powers. 
If m is the denominator of p™, then x}/" appears in integral powers in the series. 
If this quantity is considered as a new variable, then the problem is identical 
to that with which we began. 

Now, if 0go9/6y #0 at the origin, then the origin is a simple point and, 
applying theorem 1, it is possible to find a solution y(x,) of the equation 
9o(x1, y) = 0 which is a series in powers of x}/". If dgo/dy = 0 at the origin, it is 
necessary to construct a Newton open polygon for go, select edges, find the 
exponent p and the coefficients b for each edge, and so forth. 

If the rational number p'!) has an odd denominator, the quantity t”"’ is 
defined for negative as well as positive t. It is therefore sufficient to consider 
the case of o = 1, since the signs of x, and t take on both values (+ 1). Solutions 
b of equation (26) do not depend on o. If, however, the denominator of p" is 
even, then t”” is a real quantity only for non-negative t. It is then necessary to 
consider separately the cases x, > 0 and x, < 0, corresponding to o = | and 
o = —1.In the second case (o = — 1), it is convenient to change the sign of x, 
by writing X, = —x,. In this case the solutions b® to equation (26) will be 
different for different o. 

In order to relate the Newton polygons /(f) and /(g), we split transformation 
(27) into two parts: 

x, = xP"z, (28) 


z=bMO4y. (29) 


Under transformation (28), every monomial ax?'x%2 becomes a term 
ax$1 +P" 742 — gx%iz%, That is, the series 


f= Vf X? xP 
is transformed into a series 
f =D fiart2% , 


where qi = q, + q2p'” and qi = q2. Thus, the set D(f’) is the image of the set 
D(f) under the linear transformation Q’ = «*~'Q, where the matrix 


1 po? 
#-1 8 ; 
: e 1 


Under this transformation, all the points Q are shifted parallel to the q, axis, 
to the right for g, > 0. The polygon /(f) is deformed into a polygon J(/’) 
whose edges and vertices are, respectively, the images of the edges and vertices 
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(1)! 
Ty 


l(h) 


Fig. 16 


of F(f). In particular, points Q lying on the edge J;"), which corresponds to 
a truncated series f, satisfy 


CO.P) = KE POST 


Le. 9, + q2p"? = r. After transformation (28) is applied, the points Q transform 
to 
Q! = (41 + 42P",42) = (1,42) , 


that is, the edge Jj") becomes the vertical edge J;’” along which qi = r, and the 
entire set D(/’) lies to the right of the vertical q, = r. If we now let fo = x;"f", 
then the points Q’ corresponding to the truncation ff of ff all lie on the q} axis. 
Figures 10, 11 and 16 illustrate the Newton polygons /(f), '(f’ ) and I(fo), 
as well as the disposition of the edge J;") which corresponds to f. Here, the 
truncation f of the function f with respect to the vector order P is eee 
into a truncation fj of Jo with respect to the vector order (— 1,0). This fe 
depends only on z: f(z) = xj"f(x1,x?"z) = f(1,z). Hence, the root b“) 40 of 
equation (26) is just a root of the equation 


fo(z) = 0 (30) 
and determines those points on the z axis which satisfy the equation 
fo(%1,2z) = 0. (31) 


Each remaining point on the z axis has some neighborhood in which equation (31) 
has no solutions. 

Note that under transformation (28), the point x, = x, = 0 blows up into 
a line, the z-axis. The curve x. = bx? which approaches the point x, = x, =0 
transforms into a curve z = bx?-”"””, which tends to infinity, to a constant, or 
to zero, if p < p', p = p", or p > p", respectively, as x, > 0. 

If b) is a simple root of equation (26) or (30), then dfj(bi)/dz # 0, and since 


Bf5(0, DM Y/dz = dfg(bW dz , 


ER ee 7 
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the point x, = 0, z = b@ is a simple point of curve (31). In agreement with 
theorem 1 the solutions of equation (31) in a neighborhood of the point x, = 0, 
z = bY) will take the form of an expansion of y = z — bY) in powers of x1/™. 
This expansion gives a branch of the solutions of equation (2) in the form of 
the series (23). 

If b® is not a simple root of equation (30), we must make the variable 
change z = b{) + y, construct the Newton polygon for the function go(x,, y) = 
f5(x,, 6% + y), and proceed to separate the branches, as above. This will also 
lead to expression (23). 

If equation (30) has no real root b", then the solutions of equation (2) have 
no real branches of the form (23) for the given order p'” (i.e., they have only 
complex branches). 


Example 3. Let 
Sf(x%1,X2) = x} + x3 — 2x,x,=0. (32) 


The curve defined by this equation is called the folium of Descartes. At the origin, 
Of/dx, = Of/dx, =0; it is therefore a critical point. We will investigate our 
equation in a neighborhood of this point. Here Q, = (3,0), Q, = (0,3), and 
Q, =(1,1); the Newton open polygon (fig. 17) has two edges, 1") and If? 
(see example 7 of § 1). We begin with the edge I"? > Q,, Q3. Here Q, — Q, = 
(—2,1) = R,, so P, =(—1, —2) and y = 2. The corresponding truncated sum 
is f = x3 — 2x,x,. To find the coefficient b™ we solve equation (26): 


f(,b)=1—2b=0, 


which gives us b) = 1/2. This equation is linear with respect to b, and b™ is 
a simple root. Therefore to this root and in general to the edge /{) there 
correspond a simple branch of the solutions of equations (32): 


xX = 4x?(1 + o(1)) - 


In order to find further terms in the expansion of x, in powers of x, on this 
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0 (3,3) 
0(3,2) 


0(3,1) 


Fig. 18 


branch, we must make the variable change (27) 


x2 =xi(g+y). (33) 
This gives 


f(%1,%2) = 91,9) 
= xp + xi(g + gy + 3y? + y*) — 2x13 + y) 
= —2xiy + ext + axty + dxty? + xty? . 
Dividing by x}, we obtain 
Go(X1,¥) = —2y + axi + axiy + 3xiy? + xy? . 


The set D(go) and the open polygon @/(go) are illustrated in figure 18. Here the 
point x, = y = Ois simple, since at that point ég)/dy = —2 # 0 (the simpleness 
of this point follows also from the fact that 5) = 1/2 is a simple root). According 
to theorem 1, solutions of the equation go(x,, y) = 0 can be written as a series 
y = y(x,). Specifically, the first term of this series can be found from the trunca- 
tion gg = —2y + (1/8)x?, corresponding to the only edge of the open polygon 
P(go). From the truncated equation 9) = 0 we find that y = (1/16)x3; therefore, 
for the solution of the whole equation we have y = (1/16)x}(1 + o(1)). Substituting 
this into equation (33), we obtain the expansion 


X2 =4x?4+ 4x7 4+--- 


for a branch F, of the solutions of equation (32). The edge J") corresponds to 
this single branch. 

Now let us consider the edge I) > Q,, Q,. Here, R, = Q, — 0, =(—1,2), 
P, = (—2, —1),y = 1/2. For the coefficient b", with o = 1, we have the equation 


FO(1,b) = b> —2b=0. 


The roots of this equation are b = 0 and b!?) = +,/2. We are only interested 
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Fig. 20 
in non-zero roots, so we take bf!) = +,/2. Each of these roots is simple. We 
therefore obtain the two branches: 
Fyixy=f2xX, +0755 Fyixg = —V/2xyto--. 
For real x,, X,, these curves are only defined for x, > 0. For o = —1, equation 


(26) becomes b? + 2b = 0, which has no real, non-zero roots. Therefore edge 
13" does not yield real branches for negative x,. The solutions of equation (32) 
in the x,, x, plane are shown in figure 19. 


Remark. In calculations, it is sometimes easier to replace the nonlinear sub- 
stitution (27) by the linear substitution x, = bx?) + y, and then to seek such 
expansions of y whose first term is a power of x, greater than p"). 


Example 4. Let f = x*+ + x?y? — 2x?y — xy? + y? =0 (here x =x, and 
y =X,). The Newton open polygon (figure 20) consists of a single edge, with 
R =(—2,1), P =(—1, -2), y =2, f= x* — 2x?y + y?. To find the coefficient b 
we have the equation f(1,b) = 1 — 2b + b? = 0, which has the double root b = 1. 
Recalling our remark above, we make the substitution y = x? + z. Then 


f(x, y) = g(x, z) = 2? — x8 — 2x3z — xz? + x® + 2x4z + x22? . 
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The open polygon f(g) consists of a single edge (figure 21) for which 
R=(—5,2) and P=(—2,—5), y= 5/2, g=z*—x°. From the equation 
9(c, b) = b? — o = 0, we find that b = +1 for ¢ = 1; both roots are simple. 
There are no real roots for 6 = —1. We therefore have the two simple branches 


z=txPoe-, x>0, 
or, returning to y, 
yHxtxP ae 


The two branches are now distinguished from one another, that is, we have 
obtained an expansion of y in powers of x for each branch which is different from 
the expansion for the other branch. Note that in our first step (when we just 
had the double root b = 1) the two branches were not distinguished from one 
another, since the first term of each expansion is the same for both branches. 
The branches’ behavior in the x, y plane is illustrated in figure 22. 


Exercise 3. Near the origin find and sketch the real branches of the solutions 
of the following equations: 

1)x3 —x? + y? =0, 

2Qxi+x?+y? =0, 

3) x3 + y? + x?y? =0, 

4) 2x+ — 3x?p + y? — 2y? + y* =0, 
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5) (x? + y?)? + 3x?y — y2 =0, 

6) (x? + y?)? — 4x7y? =0, 

7) y® — x3y? — © = 0, 

8) x?y — 2xy? + y? + x* + 2x3 y — 2xy? — y* + ax =0 for a = 15, 16, 17, 

9) ~x? + x* — 2x?y — xy? + 2xyt + p> =0. 

The method presented here goes back to Newton [1937] (see Chebotar’ev, 
1943, Walker, 1950, Vainberg and Trenogin, 1969, where different properties of 
the expansions resulting from this method are described). The method has one 
disadvantage: it leads to expansions in fractional powers of x,. In order to avoid 
this defect, Briot and Bouquet [1856] suggested that (18) be replaced by the 
following change of variables: 


x,=u™, x, =u%™(b™ +0), (34) 


where the integers m and n are, respectively, the denominator and numerator of 
the fraction p™). After such a transformation, a series f(x,, x2) becomes a series in 
integral powers of u and v. However, the inverse of transformation (34) generally 
includes fractional powers. That is, transformation (34) is not one-to-one away 
from the coordinate axes in the complex plane C2. Therefore, we now turn our 
attention to a second method of solving equation (2), a method based on power 
transformations with unimodular matrices. 


2.9. The Second Method of Resolution of a Singularity 


As above, let R, = (r;;,12;) denote the unit vector of edge J;'”. Then the 
vector P, = (—r;,1,;) lies in the normal cone U\” and is an integral vector. Let 
a be a unimodular matrix, such that 


P’ =aP,=(-1,0). 


a= @ ) (35) 
rr 2 


where the integers s, and s, are chosen to satisfy 


In Section 1.9 we showed that 


Sil, — Sor, = 1. (36) 


For any pair of relatively prime integers r, and r,, there is an infinite number of 
pairs of integers s, and s, which satisfy equation (36). The inverse matrix is 


poets ( n “4 , 37) 


Tt Sy 


We carry out coordinate change (15) with matrix « and, in agreement with 
the formulae of Section 2.6, we have 
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xXPxP = yen ~Fid2 78241 F842 — yh yP ; 
that is, the exponent of y; is 


4, = —<Q,P)> >r = —max ¢Q,P» . 
QeD 


Sum (19) is transformed into sum (20), and the polygon [ = [(f) becomes 
the polygon 7’ = /(f"). In particular, the edge Jj") becomes an edge [;°” 
orthogonal to the vector P’ = (—1,0). Consequently, Jj” is parallel to the 
ordinate axis, and the first coordinate of every point Q € J;” is the same number 
r (figs. 10, 11). For the corresponding truncated sums we obtain 


FM 2) = LO" va) = vi Fela) - 


For the remaining terms of expansion (20) q, >r. Therefore, the function 
tS’ (1, ¥2) may be divided by yt: 


fo(V1,¥2) = vis y2) - 


The division of f’(y,, y2) by yj corresponds to shifting the polygon J’ to the left 
a distance r in the Rj plane, so that the edge Jj” lies along the q;, axis (figure 16). 
We have now arrived at the following problem: find the solutions of the 

equation 
So(V1,¥2) = 0 (38) 


as class W curves of vector order P = (— 1,0) 
y, = bir *(1 + o(1)) , 
y2 = 6j(1+0(1)),  —b, #0,00. 


The truncated series for this vector order is Te = f(y). That is, we obtain the 
following equation for the coefficient b3: 


f5(bs) = 0. (39) 


Here, we are interested in the finite, non-zero roots of this equation. If b; = b,, 
is such a root, we must find solutions of equation (38) near the point y, = 0, 
Yq = b,,. As long as fo is expressed in terms of integral powers of y, and yz, this 
is precisely the problem with which we started out: find all solutions of an analytic 
equation in some neighborhood of a known solution. 

If b, = bj, is a simple root of (39) (ie. df;/dy, # 0), then, by theorem 1, 
all solutions of equation (38) near the point y, = 0, y. = b3, are given by a series 
b(y,) in positive integral powers of y,: y. = b3, + b(y,). This branch of the 
solutions of equation (38) is separated from other branches. Returning to x,, x2 
coordinates with the help of the inverse power transformation (16), we obtain 
expressions for x, and x, in the form of series in powers of y,. That is, the solution 
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of equation (2) will be represented in parametric form (24) with the parameter t 
equal to yj}. 

If b, = b3, is a multiple root of equation (39), then the point y, = 0, y. = b}, 
may be a critical point of the equation /$();,¥2). To examine this point, we 
make the translation z = y, — b3,, construct the Newton open polygon for the 
function 


G(¥152) = fo(Vi,¥2) 


select edges, etc. In other words, we must investigate the function g(y,,z) in 
exactly the same manner described here for the function f(x,, x). 

It remains to be shown that a finite number of such steps is sufficient to 
distinguish all branches of solutions of equation (2) from each other. For an edge 
T;', we call one less than the number of integral points on the edge the height, 
which we denote by 4;. The sum of the heights of all edges of f is the height of 
the open polygon If, denoted by 6 = 5(/). The number 6 characterizes the degree 
of the singularity at the origin. If f = 0 at the origin and df/éx, # 0, then 6 = 1 
and the origin is simple. In general, using the extreme vertices Q* and Q, of 
the Newton open polygon, it is easy to see that 


O<g3-du» OS 4y4- 4 - (40) 


We now note that the height 6, of an edge is equal to the height 6 of the entire 
open polygon only if the open polygon / consists solely of that edge; otherwise, 
6, < 6. Further, under the power transformation (15) with a unimodular matrix a, 
there is a one-to-one correspondence between the points of the integral lattices 
of the g;, q2 and q}, q, planes. Thus, the height of an edge is invariant under 
those transformations. 

If we divide fil) by the factor y3 of maximal degree (i.e. fo(v2) = y3h(y2) 
with h(0) ¥ 0), then the height of Jj‘ is equal to the degree of h(y2) (figures 16 
and 17). 

In order to investigate the neighborhood of the point y, = 0, y, = y$, we 
construct the Newton open polygon /(g(¥,, z)). According to (40), the polygon’s 
height does not exceed the coordinate q¥. In turn, the number gq is the multiplicity 
of the root b, = y$ of equation (39). But the multiplicity of this root y$ # 0 does 
not exceed the degree of the polynomial h(y,), which equals the height 6, of the 
edge J;. Thus, the height of /'(g(y,,z)) cannot exceed the height of any edge of 
the original open polygon, /(f(x,, x2)). Equality is possible only when the open 
polygon /(f) consists of a single edge and when the polynomial f;03) has only 
one root y$ # 0, the multiplicity of which is the degree of the polynomial h(y,). 
That is, in all other cases the procedure described above will result in a reduction 
in the height (and degree) of the singularity being investigated, and will reduce 
to the case of a singular point of unit height. This will be a simple point, 
corresponding to an isolated branch. Unit height cannot be reached in a finite 
number of steps, like those described above, only if equation (2) has multiple 
branches. For example, if 
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flee 


then the branch x, = )°@, x} will be a triple solution of equation (2), and by 
our method of calculation the height will always be 3. 

We note further that the number of branches of the solutions of equation (2) 
does not exceed the height 6 of the Newton open polygon /(f). The number 
of real branches (corresponding to real roots y, = y$ of equations like (39)) 
may be even less. 


co 
Ms 


3 
xt) g(X1,X2) ’ 


Example 5. (see example 3). f = x} + x3 — 2x,x2; the open polygon P is 
pictured in figure 17. For edge I”, the vector R, is (—2, 1). We seek integers s, 
and s, that satisfy equation (36): 


S$; $2 
= 


=s,+2s,=1. 


Here we can use s, = 1, s, = 0. Then variable change (15) is: 
Vi, V2 = X17X2 . (41) 
By (37) the inverse change of variables (16) is 
*1=)1 >» X2=Yiy2 - (42) 
Transformation (41) transforms our function f into 
L(V Ya) = S152) = VE + VEV2 — iy - 
Dividing by y}, we obtain 
So = viet = 1 —2y. + yiy2 ; 
which we truncate to 
F3(Y2) = fos ¥2) = 1 — 2y2 - 


This polynomial has a single, simple root y3 = 1/2; the corresponding branch is 
therefore isolated. Making the substitution 


we obtain 
foy4 + 2) = —22 + RG +23 = —2z2 + bp +°° 


A first approximation to this is 


—2z+4yi > 
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with a root z = (1/16)y}; thus, z = (1/16)y? + ---, where the dots denote terms 
which contain y, in powers greater than 3; hence 


yr =dt debt 
and, according to (42), 


= — 4,2 22 4;5: ane 
X=V; X2=27y1 + i6eyi + 


This branch of the solution curve has exactly the same form as in example 3. 
For the edge 7)", the transformations of both methods are the same. 

We now turn to edge [}, for which R, = (—1, 2), and we seek integers s, 
and s, that satisfy 
Sy Sy 


| 9( TA taal. 


We could use s, = 0, s, = 1, but, after transformation (15), there would be nega- 
tive powers of y, in f’. To avoid this, we choose s, = 1,s, = —1. Transformation 
(15) takes the form 

Vy =%1xz', Yo = x1'x} , 


and its inverse (16) is, according to (37), 


MS Vie es (SIS (43) 
Thus 
F’(V15 Y2) = £(%1,%2) 


= yiy3 + yiv3 — 2ytys = yiyi(vive + yo — 2). 


The truncation is ff = y3(y. — 2), from which we obtain the root y$ = 2. This is 
a simple root, so that the branch is isolated. We find the linear part of the 
expansion of y, in terms of y, from the equation 


9(V1,Y2) = Viv2 + Y2 -2=0. 
The substitution y, = 2 + z gives 
Q(y,,2+z)=z+2ypt+--, 


from which we obtain 
z= —2yd te 


Thus, returning to our original variables, we find 
y2=2-2yite, 
and, according to (43), 
xy =2yt—2ypitor, 
x2 = 2y, —2ypt-- 


Here, y, is the parameter t~'. The branches are shown in figure 19. 
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Fig. 23 


Example 6. f = x§ — x}x3 — x3 = 0. The open polygon / (figure 23) has two 
edges. For edge I”, 0, — Q, = (—2,2), so R, = (—1,1), and equation (36) is 
satisfied by s; = 1, s, = 0. We thus have the power transformation 


YiFX > V2 = X1'X2 
and its inverse, 
X=); X2=Viy2- 
Thus 
F's ¥2) = VBS — YiV2 — yt = VIS — V2 — DS 
the roots of the truncated equation y;5f’ = —y3 — 1 = O are imaginary, y9 = +i. 


They correspond to complex branches of the solution curve of the original 
equation. Since this truncated equation has no real roots, the edge I” cor- 
responds only to complex branches. 

For edge I$, we have Q, — Q2 = (—3,4) = R. Expanding y into a continued 
fraction, we obtain y = |r,/r| = 3/4 = (1 + 1/3)~'. Equation (36) is satisfied by 
Ss, = 1, s, = —1. This gives us the transformation 


= -1 _ y-3\4 
Vi =X1X%2 y2=%1 %2> 
and its inverse 
= yt Le 
Xx, =YViy2; X2=Yiy2 - 
We thus obtain 


Sf’ = yt? y$ — yi8y2 — yi°ys = yi yi(y2 — 1 — yt) . 


The truncation P 
vif’ = y20v2 — I) 


has only one non-zero root, y} = 1; this is a simple root, corresponding to 
a simple, isolated branch of the solutions of the original equation f = 0. After 
making the substitution y, = 1 + z, we find that z = y,. Unlike our previous 
cases, this is an exact and not an approximate solution of f’ =0. Thus, 
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Fig. 24 


yz = 14+ y?, and 
X=yityi, x=yityi- 


The disposition of this solution in the x,, x2 plane is illustrated in figure 24. 


Exercise 4. Do exercise 3 using the method of this section. 


2.10. The Method of Weierstrass 


Weierstrass [1902] outlined a method of solving equation (2) with the help 
of birational changes of coordinates. We call a transformation from x,, x, to 
Y1> 2 birational if the y, can be expressed as a ratio of polynomials in x, and x, 
and the x, can likewise be written as a ratio of polynomials in y, and y,. Birational 
transformations include parallel translations, affine transformations, power 
transformations with unimodular matrices, as well as all (finite) superpositions 
of any of these. In particular, the transformations used in the preceding section 
to isolate branches of solutions are birational. We will describe Weierstrass’ 
method from our point of view, which differs from [Weierstrass, 1902], as follows. 

Weierstrass showed that branches of an analytic curve can be found using 
a finite number of transformations of the form 


Vi X1%2 > y2 =X2 (44) 
or 
Wr* > V2 = XX2 (45) 


and parallel translations. In the preceding section we saw that a finite number of 
power transformations (15) with unimodular matrices « and parallel translations 
is sufficient to isolate all branches. We will show here that a power transformation 
with a unimodular matrix « can be expressed as a superposition of a finite 
number of transformations of the form of (44) or (45). These are, in fact, power 
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transformations with corresponding matrices 


1 1 1 0 
(( ) and ¢ ar (46) 


Thus, we need to show that the matrices a described earlier can be expressed 
as finite products of matrices of form (46). A method of finding the matrix a was 
presented in section 1.9, where we also showed that @ is a product of triangular 


matrices 
1 1 0 
% = ( a) and 4% = ( ) (47) 


with integers a,. It is easy to see that 


(0 iC i) mG a)-C 4). 


That is, every matrix of the form (47) is a product of matrices of the form (46). 
Thus, each power transformation of the previous section can be decomposed 
into a finite number of Weierstrass’ transformations (44) and (45). A succession 
of transformations of form (44) and (45) is used in algebraic geometry in the 
resolution of singularities of algebraic curves; this is called a “sigma-process” 
(see Walker, 1950). The resulting power transformation could be called a “multiple 
sigma-process”, since its use would require fewer calculations than the usual 
sigma process, with identical results. 

Bendixson [1898] applied the method of Weierstrass to the resolution of the 
singularities of an ordinary differential equation (see also Dulac, 1934). His 
results could also be obtained with the transformations of our “second method”, 
as we shall see in Chapter II. 


§3. Level Curves of an Analytic Function 


3.1. Statement of the Problem 


Let the function f(x,,x,) be defined in some region. The level curves of this 
function are the curves determined by the equation 


Sf (x,,x2) = ¢ = const. (1) 


Different values of the constant c correspond, of course, to different level curves. 
In §3 we will consider the following problem: let the function f(x,, x.) be analytic 
at the point x, = x9, x, = x9; in an arbitrarily small neighborhood @ of this 
point, find explicit expressions for the level curves (1) of /: 


xi = b(c, t) > i= 1, 2 ’ (2) 


(tis a parameter), and determine their relative positions. For the sake of visualiza- 
tion, we will consider here the real case when the function f and the coordinates 
x, are real; however, all of the methods presented can be applied equally well to 
complex situations. Without any loss of generality, we can also assume that 
x? = x9 =0 and f(x, x$) =0. This can be accomplished by making parallel 
translations in the coordinates x; and the constant c. Then the function f can be 
expanded in a Taylor series 


f= V MxX*®, = fo=0, (3) 
g>0 - 


which converges absolutely in some neighborhood of the point X = (x,,x,) = 0. 
We denote here and afterwards 


X=(X1,%2),  Q=(1.42), XP HxPxP. 


3.2. Coordinate Transformations 


We review here a few facts of analysis. Let there be a transformation 


x= C(Y), €(0)=0, f=1,2, (4) 
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which transforms the point Y = (y,,y.) =0 into the point X¥ =0. In some 
neighborhood of Y = 0, let the functions €; have continuous partial derivatives, 
and let the Jacobian 


de; 06; 
D(X1%9) Oy, Oy, 
D(yi,¥2) | 8k, Ge, 

dy, ay, 


be non-zero at Y =0. Then transformation (4) has the following properties: 
(see Courant, 1931, pt. IT, Ch. 3, § 3) 

1) The image of every sufficiently small neighborhood of the point Y = 0 
contains some neighborhood of the point X = 0. That is, for every ¢ in some 
interval (0, °), there exists a 6 > 0, such that the image of the neighborhood 


lv¥l<e, i= 1,2 (5) 
contains the neighborhood 
Ix1<6 , i=1,2. (6) 


2) Transformation (4) is invertible in some neighborhood of the point Y = 0. 
That is, there exists an inverse transformation to (4), 


y=n(X), n(0)=0, t=1,2, (7) 


which is defined on some neighborhood of the point X = 0. Transformation (7) 
also possesses property 1) (with X and Y exchanged). 

3) There exist a neighborhood Y of X = 0 and a neighborhood ¥ of Y = 0, 
which lie in the domains of definition and ranges of values of transformations 
(4) and (7), and between, which these transformations establish a one-to-one 
correspondence. 

Consequently, transformation (4) constitutes a one-to-one change of co- 
ordinates from X to Y in the region %. 

If the functions ¢; are analytic at Y = 0, then they can be expanded in power 
series in Y, convergent in some set of form (5). Then the inverse transformation 
(7) will also be analytic at X = 0, i.e., the functions y; can be expanded in Taylor 
series, convergent in some neighborhood of form (6) (see Goursat, 1933, v. 1, 
Ch. IX, §§ 181-188). Frequently a transformation of form (4) is given in Taylor 
series form. Then it is defined in the domain of the convergence of those series. 
We now return to the problem of level curves. 


Theorem 1. Let the point X = 0 be a simple point of the analytic function f(X); 
ie, at the origin 

of 

Ox, 


f 


+i——| #0. (8) 
OX 
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Then there exists an invertible, analytic coordinate transformation (4) such that 
S(X)= ye. (9) 


Proof: Because of condition (8), we know that one of the partial derivatives 
Of/dx, and f/x. must be non-zero at X = 0. Assume that df/dx, 4 0. We make 
the coordinate transformation 


yrTX% > y2 = f(X) . (10) 


At X = 0, the Jacobian of this transformation is non-zero, since 


0 
D(x)\" oy; of 
—EE = —— = = —_—_ 0 e 
Gy, cer ( f of) ~ dx,” 
é 
Obviously, transformation (10) includes expression (9). 1 


The point of this theorem is that, with the help of a local change of variables, 
the function f can be written in a simple form which makes its level curves much 
easier to find. Indeed, the level curves of the function in (9) are, in Y coordinates, 
just the horizontal lines y, = t, y. = c. Near the origin of the original coordinate 
system X, this picture changes a little, and the level curves are no longer straight 
lines, but as before they will not cross themselves. They will be expressed by the 
formulas 

x; = ¢i(t,c) > i=1,2, 


where |t| < ¢, |c| < ¢. In fact, the same result is obtained by solving the equation 
S(X1,X2) —¢ = 0 with x, = &(x,, 0). 


3.3. A Nondegenerate Critical Point 


In what follows, we will consider only a neighborhood of the critical point 
X = 0, when expansion (3) contains no linear terms. If the discriminant of the 
quadratic terms, d = fi7,4) — 4f(2,0)f0,2). 18 non-zero, then the critical point 
X = 0 is nondegenerate; if d = 0, the point is degenerate. 

As we know from elementary algebra, every quadratic form 


Soot + Sir %1%2 + fo2xd #0 


can, with the help of a non-singular linear transformation X = BZ, be transformed 
into a sum of quadratic terms 


a(z? + 023) , a#0, 


where the number a is zero or +1. For a non-degenerate critical point, of course, 


3.3. A Nondegenerate Critical Point 67 


o #0. The expansion of f in terms of Z takes the form 


f=azr+aez3+ Yo foZ?. (11) 


aita2z3 


Theorem 2. Let X = 0 be a non-degenerate critical point for a function f. 
Then there exists an invertible analytic transformation (4) which reduces the 
function f to the form 

f =a(yj + oy2) . (12) 


Proof. Let the function f be reduced to form (11). As noted, a 4 Oand o # 0. 
We divide the last sum in (11) into three parts, 
h%(Z) = YY foZ? , 


92<2 


hY(Z) = Y foZ? , 


ai<2 


A(Z)= Yo foZ?. 
91-9222 
Thus, 
AY FAVLAM = YL foZ? ‘ 
41+49223 

In figure 25, the different shaded regions of the q,, q2 plane contain the supports 
of the different series h®. Let g(Z) be an arbitrary convergent series in Z, the 
support D(g) of which satisfies the inequality q,, q, > 2; .e., it lies in the doubly 
hatched region in figure 25. We will show that the system of equations 


azj + h(Z) + g(Z) = ay? , 


(13) 
aoz} + h?(Z) + h®@(Z) — g(Z) = acy3 
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has an analytic solution. In fact, the first equation immediately gives 


y, = 21/1 +a'277(h +g) . (14) 


But according to our construction, the ratio (h") + g)/z? is analytic at the origin 
and vanishes there. Therefore, the square root in (14) can be expanded in powers 
of that ratio and, consequently, can be written as a convergent series in powers of 
Z. It can similarly be shown that the solution y, = y,(Z) of the second equation 
in (13) is also analytic. [1] 


Note that this theorem allows for considerable arbitrariness in constructing 
the transformation that leads to (12) since the choice of g(Z) is arbitrary. For 
example, we could choose g(Z) = 0 or g(Z) = h. Moreover, there is even greater 
arbitrariness in the choice of the transformation of theorem 2. 

We now investigate the level curves of function (12) in Y coordinates. If o > 0, 
the level curves are the circles 


y? + y3 =a™‘c = const 


(figure 26); f has an extremum at the origin. This last equation can be written 


J2 


lan 
NS 


Fig. 26 


J2 


Fig. 27 
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as a pair of parametric equatons 
y, =/e/a cost , y2 = /c/asint , ac>0, —o<t<+0. 


Ifo < 0, then the level curves are the hyperbolas, y? — y? = c/a = const (figure 27); 
f has a saddle point at the origin. The curves are then defined in terms of the 
hyperbolic functions: 


ac>0, y= +./c/a cosht, y= Jc/a sinht ; 


c=0, yYo= ty; -—o<tT< +0. 


ac<0, y, =/—c/asinht , yy = +./—c/acosht . 


Here, the parametric equations for the level curves are different in different parts 
of the neighborhood of Y = 0. We obtain an explicit representation (2) of the 
level curves by substituting into (4) the expressions for y; in terms of c and t. 

Theorem 2 tells us that in the class of analytic changes of coordinates a 
function f can be set equal to its own quadratic form if the discriminant of the 
quadratic form is non-zero. 


Example 1. Let us find the level curves of the function 
f=xit+x3-—2x,x,. 


The discriminant is d = 4 #0. According to theorem 2, f = —2y,y,. Con- 
sequently, the point x; = x, = 0isa saddle point of f. The branches of the folium 
of Descartes pass through this point (see example 5 of section 2.9). 


3.4. Sets of Convergence of Powers Series of Class 7 


In order to prepare a means for studying the neighborhood of a degenerate 
critical point, we consider a few properties of power series in two variables. 
Following Goursat [1933, §§ 161-164], we will consider the series 


Figg. (15) 


QeZ? 


where the indices Q = (q,,q ) run over the two-dimensional integral lattice Z7. 
Let these indices Q be represented as points of the R? plane. Let us imagine that 
there is an infinitely expanding sequence of regions in this plane 


Q, © Q,0 Qo COC, 


which tends to R? as n > oo. Let 


Ss, = b ag 
QeX2,, 
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be the sum of those terms of series (15) whose indices lie within the region @,. 
If the partial sum s, approaches some limit s as n increases without bound, and 
if that limit does not depend on the choice of the sequence of regions {Q,}, then 
the series (15) converges to the sum s. 

If, for some sequence of regions {@,}, the partial sums of the series 


Dlagl . (16) 


approach some limit, then the series (15) is absolutely convergent. In this case 
the sums of series (15) and series (16) do not depend on the choice of the sequence 
of regions {Q,}. 

Everything we have asserted applies as well to the summation of a multiple 


power series of the form 
h=¥ hgX® , (17) 


where Q = (q;,q2), X2 = x7'x%, and Q € Z?. Here, however, a question arises: 
for what values of X does such a series converge? 

Let V be a closed cone in the R? plane, the boundary rays of which are 
spanned by the vectors R, and R* (figure 28). Let Y denote the class of power 
series of form (17) whose supports lie in the cone V: 


Dih)cV. (17’) 


We assume at first that the coefficients hg of such a series are non-negative, and 
that the variables x, and x, also take on non-negative values. Let two points, X 
and X, satisfy the inequalities 


Kee RRs Re RRP (18) 
Since any Q € V can be expressed as 
Q=aR, + BR* , 
where a, 8 > 0, the inequalities (18) lead to the result 
XP = (XRI(XE YP < (KBE YS = XO (19) 


Clearly, if series (17) converges at a point X, it must likewise converge at any 
point X which satisfies the inequalities in (18); in fact, 

h(X) < h(X) . (20) 
Conversely, if the series diverges at a point X, it must diverge at any point X 
which satisfies (18). 


In order to analyze the construction of sets of points satisfying (18), we look 
at the logarithm of the coordinates X: 


InX = (Inx,,Inx2) . 


Then the inequalities of (18) take the form 
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<R,y,InX> <<Ry In X) , <R*,In X> < <R*,InX) . (21) 


Note that the transformation 
P=1InX 


maps the interior of the first quadrant of the X plane onto the P plane. The 
mapping is monotonic and one-to-one. This transformation maps the point 
X,; =X, = 1 to the origin, p, = p, =0; the positive half of the x, axis is 
transformed into the “infinitely distant line” p, = —oo; the point x, = x, =0 
becomes the “infinitely distant point” py, = pz = —o, and so forth. 

Let P be some point in the R3 plane. We pass through it two lines which are 
perpendicular to the vectors R, and R*: 


K, = {P: (Ry, P> = (R,, PY} , 
K* = {P: (R*, P) = (R*,P>} . GP) 
These lines divide the plane into four parts (figure 29). The part labelled III is 
the normal cone U of the set V, displaced parallel to the vector P. If P = In X, 
then the inequalities in (21) will be satisfied by those P = In X which lie in part I. 
If P = In X, the inequalities (21) will be satisfied by the points P = In X lying 
in part [II of the plane. Returning to the X plane, we obtain a corresponding 
division of the first quadrant into four parts (figure 30). If ¥ = X, then the points 
X from part I satisfy the inequalities (18); if X = X, the points X from part 
III satisfy (18). 


Example 2. R,, = (1,0), R* = (0, 1); then (17) is an ordinary power series. The 
cone V is the first quadrant, and U is the third quadrant. The inequalities of (18) 
take the form |x,| < |x;|, i = 1, 2. 


Example 3. R, = (2, —1), R* =(—1, 1) (figure 28). For P =(-1, —1), the 
division of the R3 plane by the lines (21’) is illustrated in figure 29. The corre- 
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Fig. 29 


Fig. 30 


sponding division of the first quadrant of the x,, x. plane is made by the curves 
x, =x? and x, = x, (figure 30). 


Exercise 1. Sketch the cones V and the divisions of the X and P planes 
corresponding to 
R, = (2, 1), (2, 1), (— 1, —2) ; 


R* = (1, 2), (— 1, 2), (—2, -1) ; 
P = (1,0), (—1, 1), (0, 1), (1, 1) . 


Now let us consider a line L in the R3 plane, the normal to which does not 
intersect the cone V. If we construct a division of the R3 plane corresponding 
toa point P ¢ L, then the line L will lie in the resulting parts I and III. We denote 
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by L(P) the half-line lying in part IIT, and by L(P) the half-line lying in part I. 
The point P is not included in those half-lines. If series (17) converges at the point 
P = 1n.X, then it converges for all In X lying on the half-line L(P); likewise, if the 
series diverges at the point P = In X, then it diverges for all In X € L(P). Con- 
sequently, there exists some point M = In X¥ which lies on the line L, such that 
series (17) diverges for all In X € L(M) and converges for all In X € L( M). For 
in X = M, series (17) may either converge or diverge. An extreme case is possible: 
M may be one of the “infinite endpoints” of L; then the series either converges 
for all X, or diverges for all X not on the coordinate axes. 

But let us return to the case in which the point M is found in some finite part 
of the plane. Consider lines parallel to the line L in the R? plane. On each of 
these lines, we can find a point analogous to M. The collection of such points 
forms a continuous curve in the R3 plane. In fact, let L’ be a second line parallel 
to L, and let M’ e L’ divide the line L’ into sets of convergence and divergence 
of series (17). Then M’ lies in either the second or fourth parts of the division of 
the plane corresponding to the point P = M;i.e., between the points L and N in 
figure 29. As the line L’ approaches L, this interval NL converges to the point 
M = P. Hence the curve @M, which bounds M, the region of convergence of 
the series, must be continuous. 

Thus, series (17) converges for every point In X = P which lies in the region 
M. At any point outside of M and 0M, the series diverges, while at any points 
of OM it may either converge or diverge. 

Depending on series (17), the boundary curve QM may have a variety of 
forms. If we construct a division of the plane into four parts (as described above) 
at any point M € 0M, the curve 0M will lie in the second and fourth parts. 
Representing the curve 0M in the coordinates x, = expp, and x. = expp, 
we will obtain a curve 0.4, in the first quadrant of the X plane which bounds 
the region of convergence, ./%,, there. This curve will lie in parts IT and IV of the 
division of the plane corresponding to any one of its points. 

We now consider in class ¥ a series (17) with arbitrary numerical coefficients 
hg. We let 


|h| = |hgl |X/2 . 


For this series, we construct the curve @.4,, which bounds the region %, of 
convergence of the series in the first quadrant of the X plane. Series (17) converges 
absolutely at a point X if that point lies within the set .# which is bounded by 
the four curves 0.4%, 0.4%, 0%, and 0.@,, where the latter three are symmetric 
to the first with respect to the coordinate axes. We can also show, as does 
Goursat [1933, § 182], that the series is not absolutely convergent at any point 
X which lies outside of 4 and away from the coordinate axes. The set .# in 
the X plane may consist of one, two, or four simply connected pieces. If it is not 
empty, it contains a set 


IX <a, |X| <b. (22) 
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x2 


0 ms 


Fig. 31 


The set is illustrated for the situation in example 3 as the shaded region in 
figure 31. We will call a series (17) of class ¥ convergent if its region of absolute 
convergence contains a set of form (22); otherwise, we shall call it divergent. 


3.5. Properties of Class ¥ Series 


We consider a convergent series (17) of class ¥ (ie., having property (17’)) in 
its region .# of absolute convergence. 


Theorem 3. The sum h of a series (17) of class V is an analytic function at 
all interior points of the region , with the possible exception of the coordinate 
axes. 


Proof. If the cone V lies entirely in the first quadrant of the R? plane, then 
the sum h of series (17) is analytic at the point X = 0 and at all interior points 
of convergence (see Goursat, 1933). If V does not lie in the first quadrant then, 
according to lemma 1 below, there exists a unimodular matrix « such that the 
transformation 

InZ=alnX (23) 


transforms series (17) into 
B=) WgZ* x (OS a8), (24) 


a series whose support D’ = «*~'D lies in the first quadrant of the q}, q3 plane. 

In the R3 plane, with coordinates In|z,|, In|z,|, the convergence set M’ 
of series (24) is related to the convergence set M of series (17) by the affine 
transformation: M’ = aM. But the sum h’ must be analytic in the set M’. Trans- 
formation (23) is analytic everywhere except, perhaps, on the coordinate axes. 
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Therefore the sum of series (17) will be analytic everywhere within the set of 
convergence .%, with the possible exception of the coordinate axes. 0] 


Lemma 1. For every convex cone V < R? which does not contain a half-plane, 
there exists a unimodular matrix « such that the cone a~'*V lies in the first 
quadrant. 


Proof. Let the boundary ray R, = {cR,,} of the cone V pass through integral 
points. Let Q, = (414,924) be the point nearest to the origin. Let o = 
sgn(q24!* — 414rF); we seek a pair of integers 9,, 4, which solves the equation 


92491 — W1492 =F - 


If the point O = (q,,42) lies inside the cone V, then we choose a positive integer k 
such that the point Q* = (q*,q*) = 0 — kQ,, lies outside of V. This is possible 


since the cone V does not contain a half-plane. The cone V, spanned by the 
vectors Q, and Q*, contains the cone V. The matrix 


a~ (te ) 
qt 4 


is unimodular, and the transformation 
a*Q'=Q 


maps the cone V onto the first quadrant; the vectors Q, and Q* are mapped onto 
the unit basis vectors. The cone V’ = «*~' V lies entirely within the cone V’, hence 
is in the first quadrant. 

If neither of the boundary rays of the cone V is rational (ie., passes through 
an integral point), then, instead of V, we must use a larger cone W, which is 
bounded by a rational ray but which does not contain a half-plane. This is 
possible because the rational numbers are everywhere dense among the reals. 
The rest of the construction proceeds as above. 1] 


It is possible to show, as Goursat did, that series (17) is termwise differentiable 
within its region of convergence. The region of convergence of the partial derivative 
of the series coincides with that of the series itself; the operations of summation 
and differentiation commute in that region. 

Thus, in a region of convergence series of class 7” have the same properties 
as Taylor series of analytic functions; one can perform arithmetic operations and 
differentiation on them, and substitute one series into another. 

If the cone V does not lie in the first quadrant, then the point X = 0 is not 
included in the convergence set .@ of series (17). But if X approaches the origin 
within .@, then the sum h approaches a limit hy. As in the proof of Theorem 3, 
this can be shown using transformation (23) and Lemma 1. 
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3.6. Transformations of Class 7 * 


We denote by ¥ * the class of transformations of the form 
y=x(l+9(X), gEeKv, g()=0, i=1,2. (25) 


Such a transformation is defined on the intersection of the sets of convergence 
of series g, and g, and contains a set of form (22) (see section 3.4). 
Let X range over set (22). We will show that the range of values of Y defined 
by (25) contains a set 
ly[F#<a’, {YF <b’. (26) 


Indeed, let % be a set of form (22), on which |g;| < 1/2, i = 1, 2. On this set 
according to (25), 
3|xl<lyl<3ixd, i=12. 
That is, 
slyl<Ixl<2ly1, i=1,2, 
and 
wl VIR < [XI < yy [YI , 


ey <(X Peery 


where y,, u*, v, and v* are positive numbers, products of powers of 2 and of 2/3. 
Hence, the image of set (22) will contain set (26), where a’ = via and b’ = b/v*. 


Theorem 4, If transformation (25) belongs to the class ¥ *(X) and converges, 
then there exists a unique inverse transformation 


x = y(1 + h(Y)) , i=1,2, (27) 
which also belongs to the class V* and converges. 


Proof. The proof naturally divides itself into two parts. First we prove the 
existence of the formal series 


h=YhoY¥2eV, i=1,2, (28) 
which satisfy the equations 
yi = (1 +h) + gor + Viki. ¥2 + Y2h2)) , i= 1,2. (29) 


We will then show that the series converge. 

To start out, we assume that the cone V lies in the first quadrant. We will 
solve equation (25) by the method of undetermined coefficients. We know that 
in the expansions 

9: = >. 9i0X° > i=1,2 (30) 


Jio = 0 and, if Q is not in V, gig = 0. We need to show that there exist solutions 
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(28) of the system of equations (29) such that h,;, = 0 and hjg = 0 if Q is not 
in V. We will prove this by induction with respect to the degree of the terms, 
Oll = 41 + 42. Let the result be true for all Q: q, + q2 <n; we will show it is 
true for gq; + q, =n. Using expression (30), equation (29) can be written as 


L=1+h+VgeY7l+ayrd+hyrrlth), §=1,2. (BI) 


Thus, the coefficient of Y2(||Q|| =n) in the right-hand side of the i‘® equation 
has the form 


hig + Y Giphip, ---hiphas, ---has, =hig + Cig » (32) 
where 
P+PL tot PR tS, +°°4+5,=Q. 


Here, ||P;| < |lQ\|, IIS;ll < ll. If Q € V, we let hig = —cjg and equation (31) is 
satisfied “for the Y? term”. If Q does not lie in the cone, then at least one of 
the indices P, P,, or S, in (32) lies outside of V, since any sum of vectors in V 
must itself lie in V. Then, from our inductive assumption and by the properties 
of the series g;, we see that the corresponding coefficient g;p, h,p,, or hys, must 
vanish. This causes all the terms of the sum cjg in (32) to vanish, so that cig = 0; 
equation (27) is then satisfied by setting h;g = 0. Thus property (28) is proven. 

The convergence and uniqueness of the formal series h,(Y) follow from a 
general theorem on analytic implicit functions (see Goursat, 1933, part 2, § 185). 

We now drop the assumption that the cone V lies in the first quadrant and 
consider the general case. We make the power transformation 


InZ=alnX , 
(33) 
InW=alnyY, 
where the matrix a is as in Lemma 1. Then transformation (25) becomes 
w=2z(L+g9(Z)), i=1,2, (34) 


where D(g;) < V’ = a '!*V. According to Lemma 1, V’ lies in the first quadrant, 
and we have already proven the theorem for that case. There exists an inverse 
transformation to (34), 


z,=w(l + hy(W)) , i=1,2. 
We now use (33) to return to X and Y coordinates from Z and W coordinates. 1] 
Remark. If the cone V does not lie in the first quadrant, it is possible to invert 
transformation (25) without a power transformation. It is sufficient to partially 


order the indices Q: a vector S precedes a vector Q if <S, T> > <Q, T>, where 
T is an arbitrary vector in U, the normal cone to V. 
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The convergent transformation (25) of the class ¥ * is defined on some set 
(22). According to theorem 4, the inverse transformation (27) is also convergent 
and of class ¥ *. As shown just before theorem 4, the domain of definition of 
the transformation contains a set 


[X|®®¥<a’', |X| <b’. 


Let a” = min[a,a’], b” = min[b, b’]. Then the original transformation (25) and 
its inverse (27) are defined and single-valued on the set 


[X|Re <a”, [X|R" <b” (35) 


and their composition is the identity mapping of that set. Hence, either trans- 
formation can be considered as a change of coordinates in region (35). 


3.7. The Vertex Case 


We return to the problem of constructing the level curves of an analytic 
function 


f=Tf[oX®, fo=0. 0<0€7 (36) 


in a neighborhood of the degenerate critical point X = 0. Let us consider the 
Newton polygon I(f). Let 5 = 0 = (4,,42) be a vertex where the edges Jj") 
and J) meet. Let R; = (r,;,72;) be a unit vector along J” (ie, it is the difference 
between neighboring integral points) and let P, = (—r,;,r,;) be a vector normal 
to J;"? directed out of the polygon I’. We introduce the vectors R, = —R,j-1, 
P, = —P,_,, R* = R;, P* = P,. Then the normal cone U{ to the vertex J; is 
defined by the inequalities (R,,P> <0, <R*,P> <0, and is bounded by rays 
directed along P, and P* (figures 28 and 32). We will consider the convex cone 
V in the R? plane defined by V = {Q: (Q, P,> < 0, <Q, P*> < 0}, with its vertex 
at the origin and its bounding rays parallel to the edges /;°) and J; (figure 28). 


Theorem 5. There exists a convergent transformation (25) of class ¥*, which 
reduces function (36) to the form 


Fig. 32 
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f=feye. (37) 


Proof. So long as O # 0, one of the coordinates g, or GJ is non-zero. For 
convenience, assume g, # 0. Then the transformation 


v1 =x VFeX% . ya =H (38) 
reduces function (36) to form (37). The quotient Sf fgX°®) = 1 + g(X) is a con- 


vergent series of class ¥, with its constant term equal to 1. Therefore the root 


41/1 + g(X) can be written as a convergent series in powers of g(X), which 
yields a class VY" series. Consequently, transformation (38) is of class ¥* and is 
convergent. 1] 


Corollary. For some ¢ > 0, there exists in the region 
UO", X) = {X:|X|®* <6, |X|F < 6} 
a change of coordinates which reduces function (36) to normal form (37). 


It is sufficient to examine the level curves of the normal form (37) inside some 
region UM(E, Y). We write a = fg and examine a number of different cases. 
1) 2 = 0. We can assume that R, = (1,0). The level curves ay?! = c are 
vertical lines: 
_§ ela, if, is odd, |c| <e ; 


Vi= ee 
; law/cja, if g, is even, 0 < c/a , 


The dashed curves in figure 33 represent the boundaries of the set Y{(e, Y), while 
the vertical lines are the level curves. Note that the number c changes sign in 
U(e, Y) only if g; is odd. 

2) O > 0. Here, the level curves ay$' y#2 = c are hyperbolas of the form 


y= a|c/a| 41 7~42/% is 


Yo= xt, 0<t<m, 


Fig. 33 
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where o takes on the values +1 depending on the sign of a and c and the parity 
of q, and q,. If c = 0, the level curves are the coordinate axes (figure 34). If 
R, = (1,0), then the set #/(¢, Y) is just as in figure 33. It contains one line, 
y2 = 0, which passes through the origin. The other level curves go from one 
boundary of the set %{(e, Y) to another. 

If the vector R, lies in the fourth quadrant then the set &(e, Y) consists 
of four parts, each traversed by level curves (figure 34). 

Using transformation (37), inverse to (38), we obtain explicit expressions (2) 
for the level curves in the set &(e, X). 

Thus, with each vertex J; of the Newton open polygon If) we can 
associate a set %{(e, X), within which the function f reduces to a normal form 
and is easily studied. In the R3 plane there is a corresponding set 


U(e) = {P: <P,R,> < Ing, <P, R*> < Ine} , 


which results from the translation of the normal cone Uo along the vector 
(In ¢, In €). 


Exercise 2. Construct the set %{(e) for the function of example 1, taking 
e = 1/10. 


3.8. An Edge of the Newton Open Polygon 


Now consider J; and J;£}, two vertices of the Newton open polygon [(f) 
(see section 2.9). We will show that the points of the set W{9\(c) are placed above 
those of the set %{(e) in the first quadrant of the x,, x, plane. In fact, let R = R; 
be the unit vector on the edge J}. Then the upper boundary of the set %{(e) 
is the curve 

xUxPae, Le x, =e2x] , 
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while the lower boundary of the set Z{?\(e) is given by 
xp xy2 =e, te. x,=e Vx? . 


Since r, > Oand« < 1, then "2 < ¢~"2, and the second curve is above the first. 
Consequently, between curvilinear sectors of the sets %{(e) and %?\(e) there are 
other curvilinear sectors of the plane which adjoin the origin. We will associate 


these sectors with the edge J; and denote their union by %{"(e). Thus, 
UM (e) = {Xie < [xq |xQ! < et [x4] <e|x2| <a} . (39) 


This brings us to a new problem: find the level curves of f in these sets (39) 
corresponding to the edges of the Newton open polygon /(f). To solve this 
problem, we will employ the transformations used in the “second method” of 
resolution of a singularity (see section 2.9). We note further that, in the R3 plane 
with coordinates p,; = In|x,|, set (39) becomes a half-strip parallel to the vector P. 
We make a change of coordinates 


Vy =XUXP , Vp SXYUXP, (40) 
where r, and r, are as above, and s, and s, are integers chosen to satisfy 
$4%_— Sor, = 1. (41) 


For any relatively prime numbers r, and r, there is an infinite number of pairs 
of integers s, and s, which satisfy (41) (see section 1.9). 
In agreement with the formulas for the power transformation of section 2.6, 
we have 
XUXP = pyrdt Tada yy sai t5142 = yhiyP " 


that is, the exponent of y, is 
qi = —Q,P)> = —- QP , 


where P, = (—r,,r,) is a vector orthogonal to the edge J, and P’ = aP, = 
(— 1, 0). Therefore, for all Q’ € D’ we willhave q, > r = —max<Q, P,>. Moreover, 
q, =r for those terms foX? which appear in the truncated series f;")(X); ie., 


EUX) = FO) = vi Solr) - (42) 
For all other terms in the expansion 
f= Dir 


q, > r. Geometrically, this means that under transformation (40), the edge J; 
becomes a vertical edge /;"”, and all the remaining points of I’ are to the right 
of this edge. That is, 
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I(X) = FY) = Vi fol(¥1+¥2) » (43) 


where r > 0 and f5 contains only non-negative powers of y,. 
We now note that after transformation (40), set (39) takes the form 


Mi" (e) = {¥:e< yl <elyil<e} . (44) 


Thus, after transformation (40), our problem has become: find the level curves of 
function (43) in region (44). This region is the neighborhood of that part of the 
yz axis which contains neither the origin nor the “infinite” ends of the y,-axis. 
Now let y$ be some point of set (44) which lies on the y, axis. We will try to 
find the level curves of f in the neighborhood 


W(e.y3) = {Y:lyil <a lye — vol <e} . (45) 


To do so, we make the parallel translation y, = y$ + z, and consider the function 
of y, and z, 
F(X) = Yi fo(V1,¥3 + 22) 


for three different cases. 
1) fo(y$) = a # 0. Then the transformation 


zy =) .A/foe ’ 22 = 22 


is analytic and invertible in some neighborhood of the point y, = 0, y. = y$, 
and transforms the function f to the form f = az{. The level curves of this 
function are parallel to the z, axis (see case 1 at the end of the previous section). 

2) y$ is a simple root of the equation fly) = 0; ie., df} /dy, =a#0at 
y, = y§. Then the transformation w, = y,, Ww, = f/a = 22 + °°: is analytic and 
invertible in some neighborhood of the point y, = 0, y. = y9; it reduces the 
function f to the form 

f =awj{w, . 


The level curves of this function consist of the horizontal line w, = 0 and the 
hyperbolas w, = cw," (see case 2 in the previous section). 

3) y$ is a multiple root of the equation fo(¥a) = 0. Then the point y, = 0, 
yz = y§ is a degenerate critical point of f’(Y). In order to study the level curves 
of this function in the corresponding neighborhood (45), we must construct 
the Newton open polygon for the function g(y,,2.) = f’(¥1,¥2 + 22), divide 
neighborhood (45) into sets W; “(e), and so forth. 

The union of the sets (45) covers the set (44). Let us take some finite covering 
Wey), ..., We, yS”). It contains neighborhoods of all the roots of the equation 
fo(¥2) = Oand a number of neighborhoods of simple points of the function f9(Y). 
Set (44) is a horizontal half-strip in the R35 plane with coordinates p, = In|y,|, 
Pz = In|y,|. The sets (45) correspond to similar horizontal, but narrower half- 
strips in that plane. Altogether, they overlap the half-strip of set (44) twice—once 
for y, > O and once for y, < 0. Set (44) is covered by a finite number of the sets 
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We, y$), in which the problem is either immediately solvable, or else can 
be solved by continuing to reduce the order of the singularity. As shown in 
section 2.9, the “second method” can completely reduce the singularity in a finite 
number of steps; i.e., a finite number of steps will bring us to case 1), to case 2), 
or to case 3’) when f has multiple factors: f = z}(z, — b(z,))*g(Z), where b(z,) 
and g(Z) are analytic functions at Z = 0, b(0) = 0, and g(0) = a 4 0. Then the 


transformation 
w, = V/g(Z)/a , W, = 22 — B(z,) 
is analytic at the origin and reduces the function to the normal form 
f=awiws . 


The level curves are coordinate axes and hyperbolas (see case 2 at the end of 
the previous section). 


3.9. Synthesis and Examples 


Thus, a neighborhood % of the critical point X = 0 is covered by a finite 


number of sets 
Ue), — WOO)... (46) 


within each of which there exists an invertible change of coordinates X ~ W 
which transforms the function f into a simpler, “normal form”: 


= q q 
f=awPwP . 


Using this normal form of f, it is easy to find the level curves of f in each of the 
sets (46) and write expressions for them in the W coordinates: 


w; = w,(t,c) , b= 152: 5 


Returning from W coordinates to X, we can obtain an expression of form (2) for 
parts of the level curves of f in each of the sets (46). It remains to “sew them 
together”, and we will have found the level curves of f in the entire neighborhood 
U. A difficulty arises here, however: the transformation from X to W coordinates 
is accomplished in two steps. First, the singularity is reduced by applying a finite 
number of birational transformations, each of which is the result of a finite number 
of power transformations and parallel translations. Then f is transformed into 
normal form by a final coordinate change. However, this normalizing trans- 
formation (and its inverse) is given as an infinite series. In a real situation, 
therefore, it is only possible to calculate the transformations approximately 
(up to terms of some finite degree). As a result, the expressions X = X(z,c) for 
parts of the level curves in each of the sets (46) are all approximate. To sew 
together solutions in neighboring sectors of the sets (46) it is necessary to obtain 
approximate solutions close to the true level curves. In different cases, different 
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amounts of calculations are required. For example, to get a topological picture 
of the level curves, it is generally not necessary to make the normalizing 
transformation—it is only necessary to resolve the singularity. However, if class 
C* accuracy is required (i.e., up to derivatives of k'" order), then it is necessary 
to calculate the normalizing transformation up to terms of some finite (though 
unknown) degree. In the examples below, we will limit ourselves to finding the 
topological character of the level curves (to avoid getting bogged down in 
calculations). In order to sew together the parts of the level curves in neighboring 
sectors of the sets (46) as we try to get this topological picture, we need to know 
how the level curves behave as we approach the boundaries of the sectors. 
Together with the behavior of the level curves in the interiors, this will give us 
a full topological picture in the case when one level curve adjoins a critical point 
inside one of the sets (46). If there is no such level curve, then each level curve 
goes from one sector to another and doesn’t remain entirely in anyone region. 
From the single-valuedness of the function f, we know that the level curves must 
be closed curves (like circles). 


Example 4. We will find the behavior of the level curves of the function 


f = 2x3 — xix, — 2xyx3 4x34 xf + x3 


Fig. 36 
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in a neighborhood of the degenerate critical point x, = x, = 0. The Newton 
open polygon consists of two vertices, 1° = (3,0) and /{° = (0, 3), and one edge 
with unit vector R = (—1, 1) (figure 35). Figure 36 shows the normal cones 


UP = {P: p, <0, pp — py < 0}, 
UP = {P: pz <0, Pp: — p2 <9} , 
UW = {P: p, =p, <0}. 
Let us consider the vertex 1. The corresponding set %(°(e) in the X plane is 
Ue) = {X:|x1| <8, |x, |" |x] < e}. 


In coordinates p; = In|x;,|; this set is pictured in figure 37, and in x,, x. coor- 
dinates in figure 38. According to theorem 5, there exists a coordinate change 
X — Y in this set %{(e) such that f = 2y}; and the coordinate change is close 


a 


Yy 
in 
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to the identity near the origin. Consequently, the level curves of f in the set 
Mc) are essentially vertical lines (figure 38). 
Similarly, vertex I{° corresponds to the set 


US (6) = {X: [xy] [2174 < e,|x2|< e} 


(figures 37 and 38). By theorem 5, there is a change of coordinates X — Y such 
that f = y3. Thus, the level curves of f in %{°) are essentially horizontal lines 
(figure 38). 

The edge 7") corresponds to the set 


US "e) = {X26 < [xy [x21 < e 1, [xl < ef (47) 


(figures 37 and 38). In order to investigate the behavior of f in this set, we make 
the power transformation 


VFX; X, =); 
V2 = X2X% X2 = Vi 2 
Then 
f= yil2— yo — 2y3 + ¥2 + Yi + 1 Y2) - 
Here, 


US (e) = {¥: ly] Se <ly2l<e7} . (48) 


This set consists of the two sets in the Vis V2 plane, bounded by dashed lines 
in figure 39. Here f’ = y3fj, where fj = 2 — y, — 2y3 + y3. The equation 
fi(y2) = 0 has three roots, all of them simple: y, = 1, 2, — 1. The corresponding 
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xy 


ly 
\ 


Fig. 41 


Gs) 


three points on the y, axis are saddle points of f; the other points of that axis 
are simple points of /5. The behavior of the level curves in set (48) is shown 
in figure 39. Now we could divide set (48) into parts, each part containing no 
more than one saddle point, and transform f into normal form in each part; but 
we do not do this. Returning to X coordinates, we find that in set (47), the level 
curves of f are as illustrated in figure 40. Combining this figure with figure 38, 
which shows the level curves in the sets W(°(c) and #(e), we find that in the 
entire neighborhood of the origin, the level curves of f behave as illustrated 
in figure 41. 


Example 5. We shall find the behavior of the level curves of the function 
f=x}—xp4xt4 x3. 


Here, the open polygon /, the cones @{*), and the sets @{(e) are exactly as in 
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the preceding example; the level curves behave similarly in the sets Wc) and 
Ue). The difference is in the behavior of the curves in the set (48). There, 
fi = y} — 1. The equation fg =0 has three roots, one real (y, = 1) and two 
complex. All are simple roots. The behavior of the level curves in set (48) is shown 
in figure 42. Figure 43 shows the division of the neighborhood of X = 0 into 
eight pieces and the linear approximations of parts of the level curves in each 
piece. We need only sew together these segments into smooth curves; the result 
is shown in figure 44. 


Exercise 3. Find the behavior (near X = 0) of the level curves of the functions 
J in the exercises and examples of § 2. 
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x2 
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Hs 


Fig. 44 


Remark. In fact, the topological type of the level curves depends entirely on 
the behavior of the curve f = 0. Only branches of this curve pass through the 
point x, = x2 = 0; between two neighbouring branches is a so-called “hyperbolic 
domain”. However, the method presented here allows us to get more complete 
information about the level curves. For example, the behavior of the level curves 
in figure 44 is topologically equivalent to that in the neighborhood of a simple 
point, but there exists no smooth equivalence between these sets of level curves. 
At the same time, the problem of the level curves of an analytic function serves 
as an excellent reason for studying the local method. This method can, of course, 
be applied to more complicated problems, such as investigating the level curves 
of a ratio of analytic functions f(X)/g(X) near a point X = X° at which 
f(X°) = g(X°) = 0. In Chapter 2 we will use the local method to investigate 
the integral curves of ordinary differential equations. 


3.10. On Normal Forms 


The method presented above includes the following. For a given open polygon 
P(f), we divide a neighborhood & of the degenerate critical point X = 0 into 
several sets (ce), corresponding to the vertices and edges of ff). Then, in each 
of the sets %}(e), we make a transformation to Y coordinates so that 


f=ay2. (49) 


We then divide each of the sets US"(e:) into a finite number of subsets Wile), 
none of which contains more than one root of the truncation fj”. If a subset 
W;,(e) does not contain a multiple root of that truncation, then in it we introduce 
its own Y coordinates for which (49) is satisfied. Otherwise, we reduce the 
singularity by constructing a new Newton’s polygon and further divide W;,(e) 
into smaller subsets, just as we did originally for %. After a finite number of steps, 
this procedure will yield a finite division of the neighborhood W into subsets 


Wy,...(é), in each of which the coordinates have been transformed so as to put 
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f into the simple form of (49). Here, the exponent Q and the coefficient a will 
be different in different subsets of @. Expression (49) is , naturally, called the 
“normal form” of fin the subregion Wj, ___,(e). 
If the truncation fr has no multiple roots, we can simplify the function f in 
the entire set %{"(e) at once: 


Theorem 6. If the truncation SONX ) has no multiple roots away from the 
coordinate axes, then in the set Ue) there exists an invertible coordinate 
transformation 

x, =y(l+h(Y)), (50) 


which transforms the function f to the form f = FE Y). 


We do not present the proof. We do note that the 4; in transformation (50) 
are series in integral powers of Y, the supports of which lie in the half plane 
V = {Q: (Q, P;) <0}, where P; € U}”. Here, Vis a tangent cone. 

Thus, in the set %*(¢), the function f is equivalent to its truncation f,"”. 
We can therefore say that, in the absence of multiple roots, the truncation is 
the normal form of the function in the set #(e). 

Finally, we can limit ourselves to coordinate changes which are invertible and 
analytic over the entire neighborhood %, and seek the simplest form (the “normal 
form”) to which f can be transformed using such a change of coordinates. It 
would be appropriate to enumerate at this point a great number of different 
cases, but there is no general answer (see Arnold, 1975). This is because this class 
of transformations is extremely limited, and cannot correspond to the many 
possible complexities in f. 

Thus, the form taken by the “normal form” depends on the class of coordinate 
transformations used. Here, we consider only invertible analytic changes of 
coordinates which differ depending on the size of the region of definition. The 
smaller the region, the simpler the normal form. 


3.11. On Partitioning Spaces 


We now enumerate the objects in various spaces which we have associated 

with the series 
f= Y foxe (51) 
eeDU) 

1) In the R? space with coordinates q,, q2, to the series (51) there correspond 
the set D(f)} (the support) and its convex hull—Newton’s polygon [(f), Newton’s 
open polygon f, the edges, J;, and vertices, J;, of F, and the sets D. 

2) In the dual space R3 with coordinates p,, p2, we have associated series 
(51) with a partitioning of the third quadrant into cones U™, corresponding to 
the edges and vertices of /, as well as the sets U(e) (for some sufficiently small 
é > 0). 
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3) In the R2 space (with variables x, and x,), we have associated the series 
(51) with a partitioning of a neighborhood of the point X = 0 into sets Y(e). 
There is a One-to-one correspondence between these sets and the sets UM(e). 
Each of the sets %*(e) may consist of a number of component pieces which 
intersect only at X = 0. The union of all these sets @(e) is the entire neighbor- 
hood of X = 0. In each of the sets Y{(e), the leading terms of the series (51) 
(as X — 0) are given by a corresponding truncated series, ff. By this we mean 
the following: Let {x®, n= 1,2,3,...} be a sequence of points of &(e) which 
approach the origin as n— oo. Then on the sequence {X)}, as n— oo, all 
terms f)X2 which are contained in the truncation f, have the same order 
of magnitude, while the other terms of the series (51) have higher orders of 
magnitude. 

Our entire discussion has been for real variables X. However, all we have said 
applies equally well to complex-valued x, and x2. The sets #{(e) will be defined 
by the same formulas, but will be parts of the two-dimensional complex space 
C2. The properties described above for truncated series will remain unchanged 
by a move into complex sets %{*(e). Therefore all our theorems and power- 
transformation methods (with real matrices a, of course) apply to the case of 
complex X. 

With respect to power transformations, the values x; = 0 and x; = 00 are 
equivalent, so the theory we have described can be used to investigate a “neigh- 
borhood of infinity”. It is then convenient to assume that the space with variables 
x, and x, is the Cartesian product of a space {x,} with a space {x,}, and that 
each of the spaces {x,} is either a complex sphere (the complex case) with the 
single point x; = oo, or the real part of that sphere also with the single point 
x; = 00 (the real case). Such a space, {x,} x {x,}, contains points of the form 
X, = a@,X, = 00 or X, = 0, X, = 00; in the real case, the space is homeomorphic 
to the two-dimensional torus. The coordinates of the space R3 are p; = In|x;|. 
It is therefore convenient to consider the space R3 as a Cartesian product of two 
real lines —co < p; < 00, so that it contains points of the form p, = —00, p. =a 
and p,; = —©, p, = +00, and is homeomorphic to a square. 

Those level curves of the polynomial f(x,,x 2) for which x, > 00, x, > 00, or 
X,—+ 00, x, 0, can be determined in the same way as those for which x, > 0, 
X20. The Newton polygon I is constructed, and then that part of its boundary 
is selected where the normal cone uy intersects the cone K of the problem. Thus, 
if x? = 0 = x9, then K = {P: p, > 0,p, > 0}; if x? = 00, x} = const # 0, then 
K = {P: p, > 0,p, = 0}; if x? = 00, x} = 0, then K = {P: p, > 0,p, < 0}, and 
SO on. 


Example 6. We shall find the location in the x,, x, plane of the roots of 
the equation 


ff = xdx} + (2x, — 6x2)x3 + (x? — 12x, + 1)x3 + (2x, — 2)x, +1 


= x?(x¥ — 6x3 + x3) + x, (2x3 + 12x} + 2x.) +x3-—2x,4+1=0, (52) 
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and their asymptotic forms for which x, +0 or oo as expansions of x, in 
powers of x,. The set D (consisting of 9 points) and the Newton polygon 
(a parallelogram in this case) are shown in figure 44a. The boundary @/ contains 
the four edges 1. FY, FE, FE (fig. 44a). In this case 


UW? = {P:p: <0,p2=0}, US = {P: —p, =p, > 0} , 
UW) = {P:p, >0,p2=0}, UP = {P: —p, =p, <0} . 


Hence, the edge I") gives a truncation for x, > 0, x, > const # 0; I”) gives a 
truncation for x, +0, x,— 00; If” gives a truncation for x, 0, x,—> 
const # 0; Ff") gives a truncation for x, + 00, x, > 0. We shall successively treat 


each of these cases. ; 
I) fi) = x} — 2x, +1 =(x,- 1). 


Here x, = 1 is a double root. If we make the substitution x, = 1 + y,, then 
Sf = y} — 8x, — 16x, y2 + O(x1 3) + O(xf) . 


The Newton open polygon F (the construction of which is left to the reader) 
consists of a single edge. That is, the unique non-trivial truncation for x, > 0, 
y, 70 is f = y3 — 8x,. Its roots are y, = +,/8x,. After the substitution 


y2= +,/8x; + Zz, we obtain 
f = £22,,/8x, F 16x,./8x, + O(z3, x3) . 


The terms written out above correspond to the unique non-trivial first approxi- 
mation for z, = O/. X,), as is evident from the corresponding Newton open 
polygon. Equating the truncation to zero and solving, we obtain the root 


Zy = 8x, + O(x14/x1) ; 
Thus, for x; — 0, x. + const there are two roots, 
xX,=1+ J 8x, + 8x, + O(x,./*1) , 
both of which exist only for x, > 0. 
2) ff) = x9 + 2xyx} + x}x$ = xG(1 t+ xx) 


Here there are two double roots, x. = 0 and x,x, = —1. The first of these does 
not correspond to the cone of truncation (for which x, — oo) and is rejected; the 
second root x, = —1/x, corresponds to the cone of truncation. After making 
the power transformation 


Max, V2 = X%1X%2 5 (53) 
we obtain 
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f = x17 y} + (2x77 — Oxy")y} + (1 — 12x71 + x77)y3 + (2 — 2x7")y2 + 1 
= x77 fol%1,Y2) » (54) 
fo = ya + yz + y3 — (6y3 + 12yz + 2y2)x1 + (y2 + 2y2 + Dxt 
To the truncation fp there now corresponds the truncation 
fo = yt + 2y} + ¥3 = yi(y2 + 1) 


Following the substitution y, = —1+ 22, we obtain f9(x,,y2) = 9(%1,22), 
where 
g = 23 — 223 + 2 — 4x, + 4x12, + 6x,23 — 6x23 + 23x? . (55) 


The roots of the equation g = 0 must be found in the neighborhood of the point 
xX, = Z, = 0. The Newton open polygon for (55) consists of a single edge, i.e., 
there is a single non-trivial truncation § = z3 — 4x,; its roots are z, = +2./x;,. 
We make the substitution z, = +2,/x, + w, and obtain 


g = wy t 4./x, Ww, + 40xt + O(w3, 4/1 WE, x1 Wa, X74/%1) - 


We are interested in the roots of the function g for which |w,| = o(./x4). From 
the corresponding Newton open polygon for these roots we obtain the unique 
truncated equation = 

+4,/x,w, + 40x? =0. 


That is, w. = + 10x, ./x, + O(x?). Thus, for x, +0, x, > 0 we obtain two 
roots, = 
X_ = X,'(—1 + 2./x, F 10x,./x, + O(x})) , 


both of which are defined only for x, > 0. 
3) ALY = x? x3 (x3 — 6x, + 1). 


In this case we are only interested in roots for which x, > 00, x, > const # 0, and 
these are the roots of the equation x} — 6x, + 1 = 0. The roots, x, = 3 + 2,/2 
are simple and positive. After the substitution x, = 3+ 2,/2 + y2, we obtain 


f= £4 /2x2y, + 12 + O(x,y3, x1 92, V2) - (56) 


The corresponding Newton polygon is shown in fig. 44b. We are interested only 
in the roots of the function in (56) for which x, > 00 and y, > 0, ie., the cone 
of the problem is K = {P: p, > 0,p, < 0}. Of the four edges of the Newton 
polygon shown in fig. 44b, only the normal cone of edge J{") enters the cone K. 
To this edge corresponds the truncation 


f= +4,/2x?y, +12, 
that is, 
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no tall +o)|. 


Thus, for x, > oc, x, > const. we have the two roots, 


3 1 
X2=34+2/2F O(—}. 
x2 = 3+ Vee (3) 


1 


Both roots are defined for all values of x,, other than 0. In this case the larger 
root does not exceed 3 + 2/2 while the smaller is no less than 3 — 25/2; 


4) f2) =(xyx. +1) . 


. After the power transformation (53) we obtain the polynomial (54), where 
fi!) = (y2 + 1)?. We are interested in the roots for which x, > 00, y, > const. 
Let y, = —1 + 2,; then according to (54) and (55) we obtain 


f =(z3 — 223 + 2z$)x{? + (—4 + 4z, + 623 — 623)xj! + 23. (57) 


c d 


Fig. 44a, b,c,d 
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The Newton polygon is shown in fig. 44c. Only one of the four edges has its 
normal cone within the cone of the problem, K = {P: p, > 0,p, <0}. The 
corresponding truncation of the polynomial (57) is —4xj! + z3; that is, z, = 
+2/./x1 + O(1/x,). Thus, for x, — 00, x, + 0 we obtain two roots 


1 2 1 
x2=- + + o( *) : 
Xt Xy4/Xy x4 


Both are defined only for x, > 0. 
It may be shown that the curve (52) has no critical points; consequently, its 
branches do not intersect. The branches in the x,, x, plane are shown in fig. 44d 


(the dashed lines are the asymptotes, x. = 3+ 2/2) 


Exercise 4. Find the asymptotes and the disposition on the entire plane of 
the roots of the polynomials in the examples and exercises of § 2. 


Exercise 5. Find the level curves and their graphs of the polynomials in the 
examples and exercises of § 2. 


Remark. The local method in conjunction with a computer may be used to 
study the singularities and asymptotic properties of the level curves of quite 
complicated functions which are not polynomials (see Bruno, 1978b). 


Chapter IT 
A System of Two Differential Equations 


§1. Simple Points and Elementary Singularities 


1.1. Introduction 


In this chapter, we will consider the solutions of a system of ordinary differential 
equations 
dx, /dt =X, = 9,(%1,X2) , 
(1) 


Ax, /dt = X= @2(X4,X2) 


in some neighborhood of a point x, = x?, x, = x3. We will assume that the 
functions @; are analytic in this neighborhood. Our problem is to find the 
solutions of system (1) in the neighborhood of (x?, x9) in the form (x, (z), x2(t)), 
i.e., as functions of an arbitrary parameter t. 

The character of the t dependence of the variables x, and x, will not interest 
us. Therefore it would be more exact to say that we seek the solutions (integral 
curves) of a single equation 


dx, = 2(X1,X2) (2) 
AX, 9, (%1,X2) 


But since notation (1) is the more convenient of the two, we will use it. 

If a solution x, = 5,(t), x2 = b(t) of equation (2) is known, then the first 
equation of system (1) will tell us that 

= (eee db, (t) 
1 (0; (2), Ba(2)) 

that is, for a known integral curve, the time dependence of the coordinates can 
be found with one quadrature. 

We make the parallel translation 


Xp=exe + xe, X2=xS+ xt. 


Then system (1) becomes 
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xf = oF (xT, x3) = Ax? aE xt, x8 + x3) > i= 1,2, 


which we will investigate in a neighborhood of the origin, x¥ = x¥ = 0. From 
now on we will assume that this translation has been made, so that our region 
of interest is a neighborhood of x, = x, = 0. 

If |g,| +|@2| #0 at the origin, then it is a simple point of system (1). If 
1, = @ = Oat the origin, then it is a singular point of the system; it is a stationary 
solution of system (1). 

At such a singular point, the expansion of the function @; begins with terms 
linear in x, and x,. If we isolate these terms, we can write system (1) in vector form 


X=AX 4°", (2’) 


where A is a square matrix and X is the vector (x,,x,). Let 2, and A, be the 
eigenvalues of A (i.e., the roots of the equation det(A — AE) = 0). If|4,|+|A,| 40, 
then the origin is an elementary singular point of the system; if 2, = A, = 0, then 
the origin is a non-elementary singular point. The classification of points for 
system (1) is similar to that for an analytic function f. The analogy is especially 
notable in cases in which system (1) has an integral f = const. Unfortunately, 
different terminology has arisen for the two different situations; we therefore 
present here the correspondences: 

1) simple points of f correspond to simple points of system (1); 

2) critical points of f correspond to singular points of system (1); and 

3) non-degenerate and degenerate critical points of f correspond, respectively, 
to elementary and non-elementary singular points of system (1). 

There are two points of view in the investigation of the integral curves of 
system (1) in the neighborhood of a singular point: 

1. Seek only those solutions which pass through the singular point. For this, 
it is sufficient to use the methods employed in § 2 of Chapter I to solve the analytic 
equation f(x,,x,) = 0. Works adopting this point of view include those of 
Briot and Bouquet [1856], Horn [1894], Bendixson [1898], Dulac [1904], 
and others (see Dulac’s review [Dulac, 1934] and §4 of Bieberbach’s book 
[Bieberbach, 1953]). 

2. Seek all integral curves in a neighborhood of the singular point, where 
the neighborhood may be arbitrarily smail. For this approach, one can use the 
methods employed in §3 of Chapter I to describe the level curves of an analytic 
function. This is the point of view adopted in this work. 


1.2. The Neighborhood of a Simple Point 
In what follows, we shall be making coordinate transformations 


x= &i(¥1,¥2) ? €(0,0) = 0 > i= 1,2 ’ (3) 


in the neighborhood of the origin; these changes of coordinates map the origin 
onto itself and will be assumed to be invertible (i.e., the Jacobian does not vanish 
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at the origin). The properties of such changes of coordinates were discussed in 
section 3.2 of Chapter I. 


Theorem 1. Let the origin be a simple point of system (1). There exists an 
invertible, analytic change of coordinates (3) under which system (1) becomes 


We=l, y2=0. (4) 


This theorem means that the solutions of system (L) are topologically equiva- 
lent to those of system (4) in some sufficiently small neighborhood of a simple 
point; see figures 45 and 46. We might call this the Cauchy-Arnol’d theorem. 
Cauchy proved the existence, uniqueness, and analyticity of a solution which 
passes through a simple point, while Arnol’d [1971b] considered the entire 
neighborhood of the origin, and formulated and proved Theorem 1. We note 
that system (4) is very simple; theorem 1! says that the solutions of system (1) in 
the neighborhood of a simple point have simple structure. 


Fig. 45 


29) 


be 
ov x 


Fig. 46 
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Proof of Theorem 1. Let us differentiate equation (3) with respect to t: 


oo OR ss 0g; 5 
x= . 
dy, Oya 


ne 


and substitute the derivatives x; and y; in their expressions in systems (1) and (4). 
Then we obtain a system of partial differential equations 


06; : 

pe p(X) = o(E.(Y),62(Y)), $= 1,2. (5) 

“J 1 
The functions €; must satisfy this system if coordinate change (3) does indeed 
transform system (1) into system (4). We will show that system (5) has solutions 
in the form of power series 


é; = ye Eig ¥2 ’ 
where Q = (41,42), Y = (v1, 2), Y° = yf y¥. Similarly, we can write 
Qi = > Gis X* : 


Since 


XS = ne? =(¥ Sip YP)(Y Soe V*)® , 
then the coefficient of Y2 in this product will be a sum of terms of the form 


Cip,Sip.-++ Sap, S2r,S2R,+++S2R,, > 
where 
Prt Py +o +P, +R, +R, 4° +R, =Q. (6) 


Therefore the coefficient of Y2 in g,(&,,€,) is a sum of terms of the form 


PisSip, +++ Sip, S2R,-+-$2R,, ’ (7) 


where equation (6) again holds. We define the norm of a vector Q = (q;,q2) as 
{Qll = 4, + q2. Note that €;, #0 only for P > 0 and ||P|| > 1. As a result, all 
the P, and R, in product (7) will satisfy ||P;| < |Q|| and ||R;| < ||Q}] if ||S|| > 1 
(i.e., if there is more than one term on the left-hand side of equation (6)). If 
S|] = 1, then ||P] = [|Q|| or ||R;l| = [Ql]. But it is always true that 


IPI <l@i, [Rll < fell . (8) 


That is, the formation of the coefficients of the terms in the series g;(€,, ¢2) whose 
index norms are ||Q|| involves only those coefficients €,p and €2, of terms in 
the series €,(Y) and €,(Y) for which ||P|| and ||R|] do not exceed ||Q||. 

On the other hand, the coefficient of Y? in the series 6¢,/Ay, is(qy + USig,+1,¢3 
Equations (5) are only satisfied when the coefficients of Y@ in the series 0&,/0y, 
and q, are equal for all Q, i.e., when 
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(41 + Wig, 41,92) = >; Gisé1P, ---ip, Oar, ++ G2R,, ’ i=1,2. (9) 


The norm of the vector index of the coefficient €i4,+1,4,) iS |lQ|| + 1; inequalities 
(8) are satisfied for all terms of form (7). Therefore, if all the coefficients ,, with 
|S|| < ||Q|j are known, then equation (9) will give us the coefficient Sig, +1,42) fOF 
q, +1 #40. If g, + 1 =0, the left-hand side of equation (9) vanishes, since it 
contains a zero factor. The right-hand side also vanishes, since the series 9,(&,, €,) 
do not include negative powers (¢, = —1) of the variables y, and y,. Thus, 
equation (9) sets no limits on the coefficients €,,,,,, which can be spcified 
arbitrarily. 

We will begin the determination of the coefficients ¢,9 with the linear terms 
Eu1.o)¥1 + €0,1)¥2> for which equations (5) yield 


£i1,0) = io.) » i=1,2. 


We will specify the coefficients £9 ,, arbitrarily, giving them values such that 
the Jacobian does not vanish: 


€1(1,0) €1(0,1) # (. (10) 


$2(1,0) $200, 1) 


We can do this because one of the coefficients ~9,9) must be non-zero, since 
the origin is a simple point of system (1). For example, if @i(o,9) # 0, and we 
choose €4(9,1) = 9 and €29,1, #0, then inequality (10) will be satisfied. This 
determines the coefficients €;9 with ||Q|| = 1. 

We next find the coefficients of the quadratic terms 


€ 42,04 + Ei yV1 V2 + F0,2)93 > i= 1, 2°. 
System (5) gives 
2Ei2,0)¥1 + Si.) ¥2 = Pir,oy6S1,0¥1 + S100, 1)¥2) 
+ @io,1)(S2¢1,0)¥1 + $200.12) - 


This uniquely defines the coefficients €,.. 9) and ¢,1,1), but the coefficients Ci, 2, 
can be arbitrarily specified (as zero, for example). Continuing in this manner, 
we can find the coefficients €,9 of terms of third and higher order. Ail the 
coefficients ¢;9,4,) may be specified arbitrarily (as zero, for example), and the rest 
will be uniquely determined. 

We can thus find the series €,(Y) which satisfy system (5); with the help of 
majorizing series (see Petrovskii, 1964) it can be shown that the resulting series 
é, converge for |y,|, |y2| <¢ but we will not do so here. The Jacobian of 
transformation (3) is the non-zero determinant (10), so that the transformation 
is invertible. The proof is complete. 0] 
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We have two comments on this proof. First, transformation (3) was found by 
the method of undetermined coefficients. These coefficients €;9 were somehow 
ordered (by the magnitude of qg, + q2, for instance) and were then found succes- 
sively according to this ordering. Second, some of the coefficients (here, the 
€40,q,)) Could be arbitrarily specified, while the rest were uniquely determined. 
We will frequently encounter a similar situation in what follows. 


Remark. In theorem 1, the coefficients and variables may be either real or 
complex. If the original system (1) is real (i.e., if all the coefficients gg are real 
numbers) then the coefficients ¢;9 will also be real (as long as the €,0,,,) are 
taken as real). 


1.3. Linear Systems 
We consider the system 
Xy = Gy, X, + 4,2X2 , 


Xq = 421X1 + 422%, 
or, in matrix form, 
X=AX, (11) 


where X is a column vector. After the change of coordinates 
X =BY, det B #0 (12) 
system (11) is transformed into 
Y=B'ABY=JY. (13) 


As is well known (see, for example, Pontryagin, 1961), every system (11) can be 
transformed, under a coordinate change (12), into a system (13) in which the 
matrix J = BAB is a Jordan matrix. That is, 


A, 0 
1=(3 A 


where 4, and A, are the eigenvalues of the matrix A; o = 0 or 1 if A, = A,, and 
o =Oif ds, # Ay. 

In order to find the matrix B which transforms the matrix A into the Jordan 
matrix J, we first find the eigenvalues of A by solving the characteristic equation 
det(A — AE) = Oor 


A? — (ay, + 422)A + detA=0. (14) 


Then for each of the eigenvalues 4; we find the associated eigenvector B, by 
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solving the system 
AB; = A,B; . (15) 


This system will determine only the ratio of the elements of the vector B, (one of 
the elements must be arbitrarily specified). Once both of the vectors B, are found, 
we let them be the columns of the matrix B (we write B = (B,, B,)). The numbers 
4,, A, and o are invariants of the system (11) under linear transformations (12). 


4 3 
Example 1. Let A = ( 1 ;) . The characteristic equation (14) is 
47—444+3=0, 


b 
whence 4, = 3, 4, = 1. For B, = (;: :) we have the system of equations 
21) 


4b,, + 3b,, = 3d, , 
—b,, = 3b2, . 


b 
Setting b,, = 1, we find b,, = —3. Similarly, for B, = (3) 


4b,. + 3b. = 52 ; 
—by, = ba 5 


whence b,, = 1, b., = —1. The matrix Bis 
B= bi, by2 _ -3 1 
ba, a2 i? 
—4 4 3\/-3 1\ (3 0 
—3/\-1 0 Se Oe 


In the event of multiple eigenvalues A, = /,, if the matrix A is not diagonal, 
we find the vector B, from system (15) (i.e. B, is the unique eigenvector); the vector 
B, is then found from the system 


Thus 


J =B"'AB -( 


IF 


AB, = A,B, + B, . (16) 


2 
—1 
whence J, = 4, = 1. For eigenvector B,, system (15) is 


1 
Example 2. Let A = ( ») . The characteristic equation is 4? — 2A + 1 = 0, 


25,2 + ba. = biz ’ 
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—by2 = ba, 5 
whence we can write b,, = 1, b,, = —1. System (16) then becomes 
2b, +b, =) +h2=),4+1, 


—by, = by, +b.2=b2,-1. 


1 1 
This yields b,, = 1, b,, = 0. Thus, B = ( = ) and 


s=wap=(o it allo G0 9): 


1.4. An Elementary Singular Point 


In this section, we shall discuss what happens when 9,(0,0) = 0 in system (1), 
ie., when the origin is a singular point of the system. Then, we can write the 
system (1) in form (2’). Using an invertible linear change of coordinates (12), we 
can transform the matrix A of the linear terms of system (1) into a normal form 
(a Jordan matrix), so that the system itself becomes 


Ji =Aii t GyYi-1 + PAY) , i=1,2, (17) 


where o, = 0, 6, = o, and the series gy contain no constant or linear terms. We 
consider here the case when at least one of the eigenvalues A; is non-zero 
(i.e., |A,| + [A,| 4 0); the point Y = 0 is then an elementary singular point of 
system (17). 

Our task is to transform system (17) into the simplest possible form using 
a local, invertible change of coordinates 


VY, = 2; + (21,22) , i=1,2, (18) 


where the series €; contain neither constant nor linear terms. If we wish trans- 
formation (18) to be analytic, then the &; will be power series which converge 
for sufficiently small |z,| or |z,|. Our original problem then divides into two: 

1) Find the power series €; which transform system (17) into simplest form; 
i.e., find the coefficients ¢,9 of these series. 

2) Investigate the convergence of these series. 

In solving the first problem, we will have to deal with power series for which 
we do not know whether there exists a neighborhood of the point Z = 0 in 
which they converge, or whether they diverge in every neighborhood of the 
origin. The latter are formal power series. Arithmetic operations and differen- 
tiation are carried out on them just as on convergent series. Up to now we 
have met convergent series only, but differential equations yield divergent series 
as well. 


104 Chapter II. A System of Two Differential Equations 


Example 3. The equation 


x?dy/dx =y—x 
has the power series 


y= ¥ (k— Dix. 
K=1 


as its solution. This series diverges for any x # 0. At the same time, this is the 
Taylor series of the “true solution” 


which is non-analytic but infinitely differentiable at x = 0. 

We will thus solve the following problem. Suppose (17) is a formal system. 
The ¢; are formal power series without constant and linear terms, and the matrix 
of coefficients of linear terms is a Jordan matrix. We will ask, what is the simplest 
formal system 


2, = A,z, + O;2;-1 + W(Z) = Wi(Z) > i=1,2 (19) 


to which we can transform system (17) with the help of an invertible formal 
change of coordinates (18) (where the €; are power series with no constant terms, 
and the matrix of coefficients of linear terms is the identity matrix)? 

To order to formulate an answer, we introduce a new notation: 


Wi = 2;9(Z) = 2; > JigZ? > i=1,2, (20) 
QeN; 
where 
Q = (41,492) ’ Z = (21,22) > Z2 w= 2hzP * 


As long as the ; are series in non-negative integral powers of z, and z, and have 
no constant terms, the summations of notation (20) will be taken over the sets 


N, = {Q: integral gq, > —1,q, >0,q,; + q, > 0} , 
N, = {Q: integral gq, > 0,q. > —1,q; + q2 >0} . 


ll 


That is, N; is the set of integral points in the q,, q, plane which satisfy the 
inequalities g, > —1, gq; >0 (j #1), 4, +4. > 0. The boundaries of the sets 
satisfying these inequalities are shaded in figure 47. 


Exercise 1. Show how the linear terms of system (19) appear in notation (20); 
that is, express the /; and o; as gj. 


1.5. The Principal Theorem on the Normal Form 


We introduce notation similar to that of expression (20) for system (17) and 
transformation (18): 
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Ni 


Fig. 47 


9; = yif(Y) = y; Y figY? , i=1,2, 
QeEN; 
(21) 
é, = 2,h,(Z) = z; y higZ? 5 i=1,2. 
QeEN; 
Here, the 9,, €;, and w; are all formal series in non-negative integral powers of 
the appropriate variables with no constant and linear terms. 
Note that the matrix of the linear terms in transformation (18) is the identity, 
so that the change of coordinates is invertible; that is, the z; can be written as 
power series in y, and y,. 


Theorem 2. For every system (17), there exists a change of coordinates (18) 
which transforms it into a system (19) for which (in notation (20)) 


Gitqy,q2) = 9 > if GA, + q,A, #0 . 
That is, the only non-zero terms in system (19) will be terms 2;9;9Z® for which 
(Q,A>=0, (22) 


where A is the vector (4,, 2,). These terms are called resonant. A system for which 
all terms are resonant is a normal form. Thus, theorem 2 says that every formal 
system (17) can be put into normal form by a normalizing transformation. 


Proof. The proof of theorem 2 is given only for the case when o = 0. For the 
proof of the general case see §1 of Chapter III and Bruno [1964, 1971, 1972a]. 
With €; given in (21), we differentiate equation (18) with respect to t: 


~ oh,, , oh, , ! 
yy = 2,1 +h) +2( Se, +532) " i= 1,2 . 


Replacing the time derivatives with their expressions in terms of coordinates 


106 ; ~~ Chapter II. A System of Two Differential Equations 


from (17), (19), (20), and (21), we obtain 


Ait WA = (Aizi + 29)0 +h) +265 elas + 2191) 


Oh; 
+ Z;—(A222 + 2292) , i=1,2. 
dz, 


If we now express the y; in terms of z, and z, according to formula (18), we get 
the following system of partial differential equations: 


yz; + Aizh; + 21 + hi) f(z, + 21h1,22 + 22h2) 


oh, 
= A,2z; + Ayzjh; + 2:9; + 2:9;h; + 2, p, 18 + 2101) 


oh 
+ Z3— (Azz. + 2292), £=1,2’ « 
02, 


Collecting like terms and moving some of them from one side of the equation 
to the other gives us 


Oh; Oh; 
2:9(Z) + 25 Fy 12, + 215 5p, 222 


Oh; Oh; 
= —2,g;h, - "gy, 7191 _ 7p 282 (23) 
+21 + hp) f(z, + 21:44,22 + 22h2) , 1) 2%, 


By virtue of (18) and (21) we have 
Vifis ¥* = z(1 + hi)fisZ°(1 + hyd + hy)? 


a ZZ fis ¥ Nip, -.-Ayp,hor,--hor,Z" F 
where 
T=Pp t+ P+ Ry to +R,. 


Therefore, in the last term of the right-hand side of (23), the coefficient of z,Z2 
is a sum of terms of the form 


Siship,---hip,har,---Aar, » (24) 
where 
S+P,4+°:°+PR4R,4+:°:°+R,=0. (25) 


Equations (23) will be satisfied only if the coefficients of like powers of Z are 
equal on both sides of the equations. That is, the i equation of (23) for the 
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coefficients of z,Z2 gives the equations 


Gig + hig<Q, A> =e YX hieGir — Y hip(Pigir + P292R) 
P+R=Q P+R=Q 

(26) 

+ sum of terms of form (24) . 


Note here that hip, gjr, and fis are non-zero only when ||P|| > 1, |R|| > 1, 
[S|| > 1. But in (25) and (26), the vector Q is a sum of several vectors P, R and S; 
hence, each of these must satisfy 


IPI <{l@l, [Ri <all . 


Let us denote by cg the right-hand side of equation (26); cig depends only on 
those h;p and g,p whose vector indices have norms less than that of Q. We thus 
write equation (26) as 


Jig + hig<Q, A> = Cig » QeEN; , i=1,2. (27) 


We can now find the g;g and h,g one-by-one, in the order of the increase of ||Q|. 
Suppose we already know all the hjp and gj, for ||P|| < ||Q|| and ||RI| < |Qll; 
then the c,g are uniquely determined. We can solve equation (27) thus: 


if <Q, A> # 0, then gig = 0 and hig = cig/<Q, AD ; 
if <Q, A> = 0, then gig = cig and hig is arbitrary . 


That is, the coefficients g;g can be non-zero only for resonant vector indices Q, just 
as the theorem suggests. The assertion of the theorem is proved by mathematical 
induction on ||Qj|. The induction begins with the linear terms, for which the 
assertion is trivial. The proof is complete. 1] 


1.6. Classification and Integration of Normal Forms 
We will find all normal forms (19). See also Bruno [1971, 1972a, example 1 
of the introduction]. In section 1.4 of this chapter we introduced the sets 
N, = {Q: either the integers g,, q. > 0, or g, = —1 and the integer q, > 1} , 
N, = {Q: either the integers g,, q, > 0, or the integer gq, > 1 and q, = —1} . 


We now let N = N, UN). Thatis, the set N consists of those points of the integral 
lattice in the q,, q. plane which lie either in the first quadrant, in the second 
quadrant along the haif line gq, = —1, gq, > 1, or in the fourth quadrant along 
the half-line q, > 1, q. = —1 (figures 47 and 48). Here, A = (A,,A,); let A, #0, 
and consider equation (22). If we write 2 = 1,/A,, this equation is equivalent to 


Aq, +q,=90. (28) 
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We consider all possible cases: 
a) Ais acomplex number (ie., not real). Then for real gq, and q2, equation (28) 
has only the trivial solution gq, = q, = 0, and the normal form is 


Sy =Ay21 , Zy = Azz. (29) 


The integral of this system is 


2 Maz or zy = Cozi , (29') 
and solutions are given by z; = c,;exp4,t, i = 1, 2. 

When 4 is real, equation (28) defines a line L in the q,, g, plane which is 
orthogonal to the vector (A, 1) and thus also to A. The normal form is determined 
by those points of N through which this line passes. 

b) If 4 > 0, then L passes through the second and fourth quadrants (in 
figure 48, L = L(b)). In the fourth quadrant, the only points of N are those of 
the form q, = m > 0,q, = —1. Therefore L can pass through one of these points 
only when Am — 1 = 0, ie, 4 = m™!. If A = m7! < 1, equation (28) has a unique 
nontrivial solution Q@ =(m,—1)eN which does not lie in N,. Hence the 
corresponding term will appear only in g, = z,'!w,. Thus, the normal form is 


2, =A,2, , 22 = 22(A, + GJoim,—1)21'22-) or 
(30) 
Z,=412, , 2, = A222 + Gam,-1)21 - 
This system integrates to 
Zz, = bzy(inz,; +c), b =gam,-1)/A1 - (30’) 


If A = m > 1, the result is the same but with the variables exchanged. 
If A = 1 (ie, A, = A,), the non-trivial solutions of equation (28) in N are 
Q =(-1,l)EN, and Q = (1, —1)e€ N,. These vector exponents correspond to 
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jinear terms on the right-hand side of system (19); their matrix must be a Jordan 
matrix, so the normal form is a triangular system 


2,=4\z,, 4) = 4,2, +02, . (31) 
If o = 0, this is just system (29) with 4, = 4,; ifo # 0, the integral of the system is 
z, =Aj'z,(Inz, +c) . 


If A # mand A # m , where m is a natural number, then equation (28) has 
no non-trivial solutions Q € N, and the normal form is just system (29). 

c) If A =0 (ie, A, = 0, 2, #0), the line L is the g, axis and passes through 
all points Q = (k,0) € N, where integer k > 0 (figure 48, L = L(c)). Consequently, 
the normal form is 


Sy 

: eee 

2, = 2, sy Gi(e.0)21 = 219:(21) , 
K=1 


} Pi ee) «) 
2, =22(A,+ 2 F21k,0)71 J = 2292(2)) . 
=1 


a 


This system is conveniently written as 
(In'z,) = 9,(z;) > (In'z2) = g2(z1) . 


If g, = 0, then z, = const is the integral of this system. But if g, # 0, then the 
integral is 
= {2¥s (2i)der 
Z,= 
2191(2;) 


d) If A < 0, the line L passes through the first and third quadrants (figure 48, 
L = L(d)). There are no points of the set N in the third quadrant, of course, but 
every point of the integral lattice in the first quadrant is in N. If / is an irrational 
number, equation (28) has no nontrivial solutions and the normal form is 
just system (29). If 2 = —1 (ie, 4, = —2,), then LON contains every point 
Q = (k,k), where integer k > 0. The normal form is 


: ~ kok) — 
24> a(t a X ey, = 2191 (2122) , 
k= yh 


- 
; kek —a 
22 = 2) (2 Pe sranztzt) = 2292(Z1Z2) . 
k=1 
Under the transformation w, = 2,23, Ww. = 22, this becomes 


fe) 
= Wi ye (Gice.e oe Jack,w) Wt > 
=1 
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W2 = W, (4 + px dunt) : 


which is similar to system (32). 

When 4 = —r/s, where r and s are relatively prime positive integers, the 
situation is similar to that above. The line L intersects N at the points Q = (ks, kr) 
(again, k > 0 is an integer). The normal form is 


: { ~ ks tr | = 
z= Zia + Y GiusenZP27 | = 2191 » 
im ; (34) 


‘ 


/ 
oo 
ef ks kr ) — 
oie z,( 2, + > dous,anzi*2 ) = 2292 - 
\ k=1 7 


Since r and sare relatively prime, there exist integers u and v such that ru — sv = 1. 
The power transformation 


W, = 2125, Wy = 2423 (34’) 


transforms system (34) into the form 


ioe) 
= k 
W,=W, iy (89 sks, 4) + Gaus, ary) 1 , 
k=l 


& @ 
W2 EB + vd + Y (uGicus.ery + 29 2¢s, 4) | > 
k=1 


W2 


which is again similar to system (32). Thus, depending on /, and /,, the normal 
forms are systems (29), (30), (31), (32), and (34); the last reduces to (32). In all cases, 
the normal forms are integrable. 


1.7. Power Transformations of Differential Equations 
1 1 
meen nw= (Pe), 
Inz, Inw, 
then the power transformation 


W, = Zit zhi2 
1 1 2 ’ = Oia O12 
“= ? 


Woe 2971z 52 > 


If we write 


can be written conveniently in the matrix form: 


InW=alnZ. (35) 
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Likewise, the inverse transformation is 


InZ=a'inW. 
whence we obtain 


Z2 = exp<Q,In Z> = exp<Q,a'InW) 
= expia!*Q,InW> = W7 "2 = We , 
where a* is the transpose of a. Thus, the new vector exponent is Q’ = a '*Q. 


\ 
Taking the column vector Gg = (0% }, we can write system (19) in vector 
form: 929/ 


(InZ)= ¥ Ggz?. (36) 
QeD 
According to (35), 


(In W) = a(InZ) = Yo aGgZ2= Yo aGgw? , 
QeD QeD 


that is, system (36) is transformed under (35) into 


(nW)= ¥ Gowe, (37) 
ved’ 

where Q’ = a*"'Q and Gg, = aGg. Specifically, the set D of points Q in the q,, q2 
plane corresponding to system (36) undergoes a linear transformation with 
matrix «*~! when the power transformation (35) is applied (just as in the case of 
a function in section 2.6 of Chapter 1). Therefore, if the points of D lie entirely 
on some line L in the q,, q2 plane, then the power transformation can be chosen 
so that the image of this line is one of the coordinate axes (the q| axis, say). Then 
all points of the set D’ will have g = 0, and the right-hand side of equation (37) 
will have no w, dependence. This is, in fact, precisely how we integrated normal 
form (34). This method can also be used to obtain integrals of normal form (30). 


1.8. The Convergence of the Normalizing Transformation 


We now present without proofs (which are very complicated) results on the 
convergence of normalizing transformations in cases a)—d) of section 1.6. For 
proofs, see Bruno [1971, 1972a]. We assume here that the original system is 
analytic. 

In cases a) and b) the normalizing transformation always converges since 


KQ,4>|>ellQl, if GEN, <Q,A>#O. (38) 


In case c) the line L lies on the q, axis. There is an infinite number of points 
Q € N on that line, and the normal form (32) thus has an infinite number of terms. 
Property (38) does not hold here. The normalizing transformation converges if, 
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in normal form (32), 
gi(zi)=0, (39) 


that is, if all the coefficients g,q9) = 0. If g, # 0 in the normal form (32), then 
there exists an analytic system which transforms to this normal form under 
a divergent transformation (this follows from Theorem III of Bruno, 1971, 1972a). 
All invariants of the equation (2) with respect to analytic transformations of 
variables may be found in Martinet and Ramis [1982]. 


Example 4 (Continuation of example 3). The system 
X, =X}, X2=X2-X (40) 


is equivalent to the equation 


iam x (41) 
The transformation 
mans at Yk Dyt= ye + Ew) (42) 
transforms system (40) into 
W=Vi, Yre=y2- (43) 


This corresponds to the fact, that using the particular solution x, = €(y,) of 
the inhomogeneous equation (41), we have put it into homogeneous form. But in 
normal form (43), we have g, = y, # 0; that is, requirement (39) is not satisfied, 
and the normalizing transformation (42) diverges. 

In case d), for rational A = —r/s, the normal form takes form (34). If there 
exists a series 


alz1.22) = Y aeei'2¥' (44) 


such that 
gi = Aa ’ i= 1, 2 ’ (45) 


in normal form (34), then the normalizing transformation converges. If condition 
(45) is not met, then there exists an analytic system which is transformed to 
normal form (34) under a divergent normalizing transformation. Condition (45) 
means that all the vectors Gg in system (34) are proportional to the vector 4, 
and g,s + g,r = 0. Under power transformation (34’), system (34) becomes 


(Inw,) = sg, +192 =91 5 
; (46) 
(In w.) = ug, + 092 =9) » 


for which condition (45) implies that g, = 0. System (46) is analogous to system 
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Fig. 49 


(32), and has the same condition for convergence; it is invariant under power 
transformations. Geometrically, condition (45) just means that the vectors Gg 
are orthogonal to the line L. For analytical invariants see Martinet and Ramis 
[1983]. 

In case d) with irrational A, the normal form (29) is simple, but the conditions 
for the convergence of the normalizing transformation are not. In fact, the 
question of convergence cannot be completely answered in this case. Siegel 
[1952] proved convergence when all the integral vectors Q satisfy 


XQ, A>| > e(14i| + la2l)” . (47) 


where v is some positive number. Geometrically, this means that the integral 
points Q do not approach too close to the line L (figure 49). Condition (47) can 
be slightly relaxed in order to preserve convergence. On the other hand, if those 
integral points do approach too close to the line L, then there will be divergence. 
The exact boundary (with respect to A) between convergence and divergence 
has not yet been established; but as of this writing, sufficient conditions for 
convergence and sufficient conditions for divergence are known; they are very 
close to each other (see Bruno, 1971, 1972a). 


1.9. Real Systems 


Until now, we have been concerned with complex systems (1); that is, systems 
in which the coefficients in the series @,, as well as the coordinates x,, are complex- 
valued. Further, we have used complex changes of coordinates to simplify these 
systems. 

But now we wish to consider specifically real systems (1), i.e., those in which 
the coefficients of the series g;, and the coordinates x;, are real-valued. For such 
systems, it is natural to seek simplifications with real changes of coordinates. 

Thus, we consider the real system 


X = AX + @(X), (48) 
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where the series 9 contains no constant or linear terms, and the coefficients of the 
series © and the elements of the matrix A are all real numbers. The characteristic 
equation for A, det(A — AE) = 0, has real coefficients, so the eigenvalues 1, and 
A, of A (the roots of the characteristic equation) are either both real or both 
complex. If they are real, then the linear part of system (48) can be put in Jordan 
form by a real linear transformation. As we saw in the proof of theorem 2, the 
transformation to normal form will also have real coefficients if the resonant 
coefficients h;g are taken as real. Consequently, the normalizing transformation 
and the normal form will both be real. 
If the eigenvalues are complex, they must be complex conjugates 


A,=4,=pt+iv, v0. (49) 
In this case we call a system 
B= nf¥i,%2), jHul,2 (50) 


a real normal form if the system is an ordinary normal form in the complex- 
conjugate coordinates 

2, =0, +102, 22 = 0, — iv, (51) 

Theorem 3. A real system (48) with complex-conjugate eigenvalues (49) can be 


transformed into a real normal form (50) with the help of a real, formal coordinate 
change (3). 


Proof. By virtue of (49), 4, # 4,, so the Jordan form J of the matrix A is 
diagonal. Let B, be the eigenvector of A corresponding to the eigenvalue /,; then 
its complex conjugate vector, B,, will be the eigenvector corresponding to the 
other eigenvalue, A, = 1,. We write the matrix B as (B,, B,). We saw in section 1.3 
of this chapter that under the linear transformation 


X = BY (52) 


the matrix A becomes the Jordan matrix J = B~1AB. Real values of variables X 
then correspond to complex-conjugate values of variables Y: 


WSF + (53) 


For a series f(Y) = ¥ fo¥®, we will let f(Y) = ¥ foY2. 
Under transformation (52), system (48) becomes 


YHA + VA(M1,V2) » rS1,2-. (54) 


Since the variables y; are complex-conjugates, the second equation of (54) is 
conjugate to the first, and 


Arya + Vif Ya) = 422 + Vof2(V1,V2) - 
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Taking equation (53) into account, we obtain the correspondence f,(¥1,¥2) = 
J2(¥2-¥1), which yields the following relation between the coefficients in the series 


f, and fy: 


Surar.an) rn Srtexa)d : (55) 
Similarly, if the formal change of coordinates 
y= 24+ Zz: >, higZ? . i= 1,2 (56) 
satisfies = 
Aycay.a2) = A2@2.40 > (57) 


then the complex-conjugate coordinates y, and y, correspond to complex- 
conjugate variables z, and z, (i.e. z,; = Z2). We will show that there exists 
a normalizing transformation (56) with property (57), which transforms system 
(54) into the normal form 
2,= 4:2; + 2; GigZ® , i=1,2, (57') 
where 
9 1041.42) = 92@2.41) - (58) 


We will prove this by mathematical induction on the norm ||Q||. Let Q = (41,42), 
Q’ = (42,q4,), and let properties (57) and (58) be satisfied for all coefficients hjp 
and 9x for which ||P|| < ||Q|| and ||R|| < ||Q||. We will show that the equations 


hig= hog: > G10 = 929: (59) 


are satisfied. Recall that the coefficients in the series f, g, and h are related by 
equations (26). The right-hand sides of these equations, which we denoted by cig, 
will thus satisfy 

Cig=C29° - (60) 


This follows from property (55) and from the inductive hypotheses of properties 
(57) and (58). Equations (27) yield 


Gig +t h1g<Q,A> = C19, G2q' + hog: (Q', AD = C29 - 
In agreement with (49) 
<Q, A> = wd + 42) + (G1 — G2) » 
<Q’, A> = H(Qi + 92) + iV(G2 — 41) - 
That is, <Q, A> = <Q’, A). Therefore, if <Q, A> 4 0, then 


- c Ci9° 
a a eS a A 


and, thanks to (60), property (59) is fulfilled. If <Q, A> = 0, then g:9 = 929 = Cig, 
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the coefficient h,g can be arbitrarily chosen, and we take hjg. = hig. Thus, 
property (59) is satisfied for all Q, and system (57’) is a normal form. Transfor- 
mation (51) from Z to V coordinates then transforms the complex normal form 
(57’) into a real normal form (50). Finally, note that the successive application 
of transformations (52), (56) and (51) results in a real transformation from X to 
V coordinates. The proof is complete. 1] 


For the general case, the proof of the existence of a real normalizing trans- 
formation is given in section 1.9 of Chapter HI. 

For a discussion of the divergence of such transformations, see Bruno [1982] 
and Martinet and Ramis [1986]. 


1.10. On a Smooth Normalization 


For real coordinate changes, it is possible to use not only analytic trans- 
formations, but also smooth (i.e., continuously differentiable), or just continuous, 
transformations. Since there frequently exists no analytic normalizing transfor- 
mation, the following problems arise for real systems: 

1) In what cases does there exist an infinitely differentiable transformation 
to a real normal form? 

2) In what cases can the system be linearized by means of a continuously 
differentiable change of coordinates? 

3) More generally: what is the simplest system to which a given system can 
be transformed with the help of change of coordinates of a finite degree of 
smoothness. 

At present, we have the following answers to the above questions: 

1) This question was posed by Birkhoff [1929]. Sternberg [1958, 1959] and 
Chen [1963, 1965] both arrived at the result that, provided that 


Red, #0, Red, #0 (60’) 


there always exists an infinitely smooth normalizing transformation. Bruno 
[1973c] presents an hypothesis which, for two-dimensional systems, can be 
formulated thus: 


Hypothesis. There always exists an infinitely smooth normalizing transformation 
for a two-dimensional, real, analytic system. 


Tokarev [1977], refining a result of Bibikov [1971], proved the validity of 
this hypothesis in the case of pure imaginary 4, = 4, = —A,, when condition 
(44) is not satisfied. Tokarev [1979, 1984] and Belitskii [1986] proved the general 
case. 

2) Under condition (60'), every two-dimensional system is equivalent to its 
linear part in the class of continuously differentiable changes of variables (see 
Hartman, 1964, Ch. 9, 148). Samovol [1972] formulated the conditions for a 
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linearization for an infinitely smooth system in the n-dimensional case by means 
of a transformation of a finite order of smoothness. Belitskii [1973, 1975b, 1978, 
1979b] studies conditions for a linearization ofa finitely smooth system by means 
of finitely smooth transformations. If condition (60’) is not satisfied, it is easy to 
construct examples in which the integral curves of the normal form and of the 
linear system are not even topologically equivalent. 

3) There are no complete answers to this question, though there are some 
isolated results (see Samovol, 1979, 1982). 


1.11. Integrating Real Normal Forms 


In the same order as in section 1.6 of this chapter, we will analyze the behavior 
of the integral curves of normal forms of real systems (see also sec. 1.8). 
a) Ais acomplex number. Then Re A, = « # 0. The normal form is the system 


2,=412, , 2, =Ay2, . 
The real normal form, in the variables of (51), is 


By = pv, — vv, , by = voy + Uv, . 
Its solutions are 
v, = ce" cosvt , v,=ce"sinvt . (61) 


These integral curves are spirals, approaching the origin in the v,, v, plane 
(figure 50). This disposition of the integral curves is called a focus. So long as 
the normalizing transformation in this case converges, then the integral curves 
of the given system will be slightly distorted spirals. 

b) Ais a real, positive number. In this case, 4, and A, are also real. The normal 
form may be of type (29), or (if 2 = m™') type (30). For system (29) with 4, # A, 
the integral curves z, = cz{?/*" converge to the singular point, tangent to the z, 
axis if |A,| is the smaller of |A,| and |4,| (figure 51). Moreover, both axes are 
integral curves. If A, = 23, the integral curves are lines z, = cz, (figure 52). 

If 4 =m", and in the normal form (30) gom,-1) #0, then for m> 1 the 
disposition of the integral curves (30’) is similar to figure 51, except that the z, 


Fig. 50 
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22 


Fig. 51 


Fig. 52 


Fig. 53 


axis is not an integral curve (figure 53). Note that in this case the smooth 


transformation 
wy = 2, — b2zT In|z,| (62) 


transforms system (30) into (29); that is, these cases are equivalent in the class of 


smooth changes of variables. 
If A = 1 and o £0 in system (31), then the logarithm will be dominant in 
the integral curves 2, = oA;1z,(In|z,| + c) (figure 54). Transformation (62) is no 
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22 


Fig. 54 


Fig. 55 


longer smooth, but it remains continuous and takes system (31) into form (29). 
Since the normalizing transformation always converges when 4 > 0, all real 
systems in this case are equivalent to system (29) from a topological point of view. 
This case is called a node. 

c)A = 0. Here A, = 0, and A, is a real and non-zero number. The normal form 
has the form of system (32). If condition (39) is satisfied, the normalizing trans- 
formation converges, and the solutions of the normal form are z,; = c = const, 
Z, = expg,(c)t. The integral curves are vertical half-lines (figure 55), and the z, 
axis consists of fixed points. 

If g, £0, then g, = 91¢,0)21 + °°"; the integral of the system (32) is 


Z,)dz A 
In|zy| +e = | 2 dz, 2 (1 +-++)dz, 
2191(21) 9i(,0)71 
A 
= —__?_ _ 7+ "*)4 
19 ,¢,0)21 
since g, = 4, +--:. Here, the z, and z, axes are always integral curves. The 


behavior of the integral curves depends on the parity of | and the sign of the 
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2 


Fig. 57 


ratio A3/9,.1,0)- If | is odd, the integral curves behave exactly as in figure 56, 
up to a change of sign in the z, coordinate (a saddle-node). If | is even, then 
the behavior is as in figure 57 for A2/9,¢,0) > 0, or figure 58 for 42/g1<,9) < 9. 
In figures 56 and 57, the tangency of the integral curves to the z, axis is of infinite 
order. 

Although the normalizing transformation diverges in cases when g, $ 0, 
there always exists a real, infinitely differentiable normalizing transformation. 
Belitskii [1986] has proven this (see the hypothesis in § 1.10 above). 

d) A < 0. There are two principal sub-cases. 

First subcase: the numbers /, and /, are real. Then the integral curves of the 
normal form are hyperbolas (figure 59—a saddle point). As Sternberg [1958, 
1959] showed, there always exists a real, infinitely differentiable transformation 
into a normal form in this case (see also Birkhoff, 1929 and Chen, 1963). If A is 
irrational, the normal form is a linear system (29). For rational 4, the normal 
form (34) contains, generally speaking, infinite series. But with the help of a formal 
change of coordinates, it is possible to truncate the right-hand sides of system 
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I 
a 


Fig. 59 


(34) to a finite number of terms (see section 2.3 of Chapter III). In the class of 
continuously differentiable changes of coordinates, the normal form is always 
equivalent to a linear system (see the preceding section). 

The second subcase: the numbers 4, and 4, are complex, hence pure imagi- 
nary: A, = —A, = iv. Here A = —1, and the normal form is system (33). This is 
integrated by introducing the variable w, = z,22 = v? + v3, since 


Wr = W,(9,(W,) + g2(w1)) - 


Here the reality condition gj (9, .4,) = 92(q,.4,) implies that 914.4) = Ja,%» that is, 
that all coefficients of the series g,(w,) + g2(w,) = 2) Regia, nwt are real. The 
variable w, is also real, and will be the square of the length of the radius vector 
in the real plane v, = Rez,, v. = Imz,: w, = 2,2, = (Rez,)? + (Imz,)?. 

Ifg, + g, = 0—that is, if all the coefficients g,4,,,, are pure imaginary—then 
the system integrates to w, = const., and the integral curves in the v,, v2 plane 
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vy 


Fig. 60 


are the circles v? + v3 = const. (figure 60). In this case (called a center) the 
normalizing transformation always converges. 

If g, + 92 #0, then g, + g, = aw; + °*:, where the number a is real and 
non-zero, | 21, the coefficients g,q,,) are pure imaginary for k <J, and 
a= 2Regyq,1 #0. The integral of this system for real w, is 


foe. io ( gi(w,) dw, cai Reg, dw, _f Img, dw, 
*S wi(g1 + 92) ‘wig, + G2) Jwi(gi + 92)’ 

but 

R Ll I ImA 

eee oe ei cath 428), 

9+92 2 91+ 92 aw, 
Therefore, the above integral is 

1 iImA, 
Inz, Fg nwt Fah (l+-"'). 


That is, the integral curves are spirals: as w, — 0, they turn clockwise if Im /,/a > 0, 
counterclockwise if Im A,/a < 0. 

This case is called a non-structurally stable focus; its existence and character 
are determined by the nonlinear terms of the normal form. In this case, the 
normalizing transformation diverges, but there exists an infinitely smooth 
normalizing transformation (see Tokarev, 1977). 

Necessary and sufficient conditions for the existence of a center are the 
countable number of the conditions 


Dickey + 920% =O, k=1,2... 


Here, the gj,4) are polynomials in the coefficients of the given system (48). If the 
right-hand sides of the equations in that system are polynomials of degree m, 
then it is sufficient that a finite number of those conditions be met (only for 
k < n(m)). Thus, n(2) = 3 (see Sibirskii, 1976, Siegel, 1956, § 25). 
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Remark. Normalizing transformations may be used to integrate differential 
equations approximately. To do this for system (1), only terms up to some finite 
degree are reduced to a normal form; all remaining terms are discarded. The 
solution of such a “truncated” system will be an approximate solution of the 
entire system if the “truncated” normal form contains leading terms (for example, 
the term g;y),9)2; in the case c), or an analogous term in the case of a non- 
structurally stable focus). 


Exercise 2. For each of the systems given below, find the normal form, 
integrate it, and sketch its integral curves. 

)x=ax-—4yt+(xt+ yt p=x—(x—y)t,a=6, 5,4, 3. 

2)X = 2x + y + 7x? + 3y?, p= —x + 6x? + y?, 

For the systems below, find the normal form only to those terms necessary 
to determine the character of the integral curves; find the behavior of those 
curves. 

3) X = 3x + 4y + 17x? + xy, p = x — 23y?, 

4) X = 2x + y+ 2x? + y?, p= 6x + By + 3x? + xy. 

5)X = y+ 6x? + xy’, p = —4x — 2x?y F y?. 

6) For what values of 4 and v does solution (61) approach the origin as 
t— +00, and in what direction does the spiral turn in the v,, v, plane? 


§2. Generalized Normal Forms 


2.1. The Second Theorem on the Normal Form 


Up to this point, we have considered systems 
Xi = Q(X1,X2) , i=1,2, 


in which the ~; have been series in non-negative powers of the variables. This 
reduced to the form 


X= Ax, + x, V figX? = Axi t+ xifi s i=1,2, (1) 


where the vector exponents Q ran through a set N almost entirely contained in 
the first quadrant of the R? plane. For each vector Q, we form a vector coefficient 
Fo = (fig, 2g) and create a set D = D(F) = {Q: Fg # 0}. That is, D(F) is the set 
of vector exponents of system (1) (the support of F). For example, if the functions 
@; are divisible by x,, then the set D for system (1) lies entirely in the first 
quadrant. 

We now consider a more general situation, in which the set D(F) lies in 
some sector V of the R? plane, bounded by rays extending from the origin 
(figure 61; compare with section 2.5 of Chapter I and figure 28). Thus, let R* and 
R,, be two linearly independent vectors, and let P* and P, be vectors orthogonal 
to R* and R,, respectively, with 


<R*,P,> <0, CR,,P*> <0. (2) 
We define the sector V of the R? plane as the set of points Q such that 


Py><0, <Q,P*> <0. (3) 


The boundary of the sector V consists of the rays passing through the points R* 
and R,,. Because of condition (2), the sector V lies to one side of the line passing 
through the origin and R* (likewise R,,); that is, the angle of the sector V is less 
than z, and V is a convex cone. Therefore every point Q in V can be expressed 
in the form 
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Q=a,R*+a,R, , a, 20, a 20. (4) 


We introduce the vector T = — P* — P,. The equation <Q, T > = 0 defines 
a line through the origin in the g,, q2 plane; the sector V lies entirely on one side 
of this line, since for Q # 0 at least one of the scalar products — <Q, P*> and 
—<Q,P,,> must be non-zero by the linear independence of the vectors P* and 
P,. But if Q € V, condition (3) informs us that these products are non-negative. 
Hence, for 0 € V\0 we have 


We denote by ¥(X) the class of all series of the form 
f= Y fox? > (5) 


where the integral vector exponents Q lie in the sector V. Note that sums and 
products of series of class ¥(X) also belong to this class. If |X|*° +0 and 
|X|®* +0 then, by (4), X20, for all Q © V\O. We therefore call series (5) 
convergent if it converges absolutely for some ¢ > 0 on the set 


My (e) = {X:|X1* <2 |XF < e,|x1| < [x2] <2} - 
Remark. Suppose a series (5) belongs to the class ¥, has no constant term, 


and converges on the set Wy(«). If we calculate (ec) = max|f(X)| for X € Wy(e), 
then y(e) > 0 as ¢ > 0. 


Lemma 1. If in system (1) the f, are convergent series of class V(X), then every 
coordinate transformation 


x; = y,(1 +h) =yi+y¥ higY? > i=1,2, (6) 
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where the h; are convergent series of class V(Y), transforms system (1) into the 
system 
w= AY + VG = Ai + Vi dV GieY? , i=1,2, (7) 


where the g; are also convergent series of class V(Y). 


Proof. 1) If system (1), transformation (6), and system (7), are written in vector 
form, 


X =@(X), (8) 
X=Y+ RY), (9) 
Y= WY), (10) 


then differentiating equation (9) gives 


1] 


ae a=) . 
X=¥ 4+—Yaul E+ — |F, 
Y ( 7) 


Employing expressions (8), (10), and once again (9), we obtain 


d=) _ = 
(e+=)en-or+e) 
or 
dz\ 
wy=(e+S) PY +). (10°) 


2) Let us say that a vector W =(w,,w,) is of class ¥* if w.y;'e ¥%. For 
example, = is a (convergent) vector of class ¥*. 
Our problem ts to show that ¥ is a vector of class ¥*. First, we shall show 


that the vector ®(Y + 5(Y)) is of class ¥*. Returning to the notation of (1) and 
(6), we see that 


Q; = Xi F(X) = x; dV SisX* 
= y(1 + hj) Vfis( +h, yl + h,)?¥* ’ i=1,2. 


Note that for a convergent series h; € Y, the series (1 + h,)* will be a convergent 
series of class ¥ for any integer s;. In fact, ifs; > 0, then (1 + h,)* is a polynomial 
in h; and is, consequently, a convergent series of class YW together with h;. And 
if s; < 0, then for |h,| < 1 the expression (1 + h,)* can be expanded in a conver- 
gent series of positive integer powers of h,;—once again, a convergent series 
of class ¥. The inequalities |h;| < 1! will be satisfied for series h; which, for 
sufficiently small ¢, converge on the set 


Uy(e) = {¥:|VI* <elVIP <ely,l<elyal <e} (11) 
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Therefore the product Y5(1 + h,)#(1 + h,)*? is a convergent series of class ¥, 
since S € V. So, too, is the series 


(1 + hy) Si fis YOU + hy (Ll + ha) , 


that is, the vector ®(Y + 2(Y)) is of class ¥*. By the remark preceding the 
lemma, {h;| < 1/2 for Ye Wy(e) when € < €9; hence 


[X[R” = |x, IT )xgh? = [YR UL + Ait + halt 


< YP) — Sy 4 ay < 28 (ayt = te 
Similarly, 
[X|Re < e(3)*27 = cye . 


That is, X ¢ Wy(e,), where e, = emax[c*,c,]. Thus, X € Wy(e,) for Ye Wy(e). 
So long as the vector (X) converges for X € Wy(e,), the vector ®(Y + 5(Y)) 
also converges for Ye Wy(e) for sufficiently small ¢. Therefore ®(Y + =(Y)) is 
a convergent vector of class ¥*. 

3) A matrix A = (aj) is said to be of class ¥** if ayy;'y,;e VW. Thus, the 
matrix 


O81 Oa 
az | OV: = Oy2 
oY | ag, ab, 

\ey1 Oy, 


belongs to class #**. Further, we have 


gre ae 
(« a 1 Oy2 6y2 re 
+35) =- ‘ : 
a el ees 
ey, oy, 


where 


4= aei(e + =) = (1 + sh (1 + 2) ahi Chas 
oY Oy, yz) yz Oy 

and the matrix to the right of 1/4 belongs to class ¥**. Clearly, 4 is a convergent 
series of class ¥ with a constant term 1; ie, 4 = 1+ 4(Y). By the remark 
preceding the lemma, if Ye Wy(e) for sufficiently small e, then |4| < 1. Therefore 
47! can be expanded in a convergent series of positive integral powers of 4, so 
that 47' is a convergent series of class ¥, consequently matrix (11’) is a convergent 
matrix of class V7 **, 

4) Finally, we note that the product W = AZ of a vector Z of class Y* by 
a matrix A of class ¥** is again a vector of class ¥*. In fact, w, = Y.ajz, = 
y:; ¥\(ayyi* y;)(z)y7"), where the expressions in parentheses are class Y series. 
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Consequently, the product 
OF = 
(e + ) @®(Y + 2(Y)) 
€ 


will be a convergent vector of class ¥ *. But this is just Y(Y). The lemma is 
proved. [J 


We will now consider formal series of class ¥, which may even diverge on 
any set Wy(«). Since the arithmetic operations and differentiation are the same 
for formal series as for convergent series, we can conclude the following from 
Lemma !. 

If in system (1) and transformation (6), the f; and h; are formal series of class 
¥, then the g; in system (7) are likewise formal series of class ¥. We then face 
the problem of transforming the formal system (1) into the simplest possible 
system (7) with the help of a formal change of coordinates (6). The answer to this 
problem is given by 


Theorem 1. Every formal series (1) can be transformed by a formal variable 
change (6) into a system (7) in which gio = 0 if <Q, A> # 0. Here, f;, h; and g, are 
series of class ¥ in their respective variables. 


Proof. The proof of this theorem is completely analogous to the proof of the 
fundamental theorem on normal forms (theorem 2 in section 1.5 of this chapter). 
Just as we did there, we can obtain the following equation for the coefficients 
of y,Y?: 


Jig + hig<Q, A> Se Y hipGir — ‘¥ hip P, Gr» 
P+R=Q P+R=Q (12) 
+ asum of terms of form fish, p,..-hiphor,---Mar, > 
where 
S+ Ppt +P4R)+°°+R,=Q. 


Let us denote by cjg the right-hand side of the i-th equation (12), so that 
Jig + hig<Q, A> = Cig . 


Now we order the vector indices P, Q, R in the following manner: P precedes Q 
if <P, T> < <Q, T>, where T = — P* — P,,. Then for all fis, hip, and giz, we have 
0 < <S,T)>, 0 < ¢P,T>, and 0 < <R, T>. Therefore c;9 depends only on those 
hyp and g,x for which vector indices P and R precede the vector Q in the above 
ordering. Moreover, there are only a finite number of integral vectors Pe V 
which precede Q. Hence, we can define the coefficients hg and gj9 just as we did 
in the principal theorem. 0] 
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Remark. If (R,, A> = 0, then the transformation (6) of the theorem can be 
chosen so that hig = 0 for <Q, A> = 0. In this case gig = fig for <Q, A> = 0. 
That is, the coefficients of the normal form (7) are found with no trouble: we 
simply retain those terms figX? with exponents Q that lie on the corresponding 
edge of the sector V. The edge itself lies on the line L defined by the equation 


<Q, A> = 0. 


Proof of the Remark. If a point Q lies on a boundary of the sector V and 
Q = P + R, where Pand Re V, then P and R must likewise lie on that boundary. 
This is a property of convex cones. Therefore, if Q lies on an edge of V, then all 
the vectors S, P, P,, and R; in equation (12) must lie on that edge. That is, all the 
coefficients hip, h, p,, and h,,, vanish, by the induction hypothesis. Consequently, 
equation (12) takes the form 


Jio + hig<Q, AD = fig - 


whence gig = fig because (<Q, A> = 0; the hjg are arbitrarily set to zero. 0] 


2.2. Integrating the Generalized Normal Form 


We consider the case when system (1) is real. Then A is a real vector, and 
the equation 


<Q, A> = 0 (12’) 


defines a line L in the q,, q, plane. The normalizing transformation (6) and 
normal form (7) will likewise have real coefficients. If the series of the given system 
and the normalizing transformation (6) converge, then we must examine the 
normal form in some set (11). The construction of the set M%y(e) for the situation 
in which R* and R,, lie in the second and fourth quadrants, respectively, was 
discussed in § 3 of Chapter I. In what follows, we will assume that we are treating 
exactly that situation, and we will examine the integral curves of the normal form 
in the set Wy(e), which here consists of four curvilinear sectors. We will use the 
same classification of cases as earlier. 

Case a) when the sole real solution of (12’) is the origin, is not allowed here; 
we proceed with case b). 

b) The line L intersects the sector V only at the origin (fig. 61 L = L(d)). The 
normal form is the system 


yi =A; 5 i=1,2; (13) 


this integrates to y, = cy??/*!. We note here that 


* ey 
Dp r A r 
2 i. 2 — Pr 1x 


* 2 | . 
Py Tr Ay OP ix oy 
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Fig. 62 


Fig. 63 


That is, in the first quadrant of the y,, y. plane the integral curves pass between 
the boundary curves, y, = e!?y7""? and y, = e!2+y7"+/"24, of the set Wy(e), as 
y, 20 (fig. 62, where the boundary curves are shown as dashed lines). The 
normalizing transformation in this case converges, and the behavior of the 
integral curves of the given system in each curvilinear sector of the set Wy(e) is 
the same: the integral curves enter the sector through both boundaries. 

c) The line L lies along the boundary of the sector V (fig. 61, L = L(c)) and 
R,, is an integral vector with relatively prime components. The normal form is 
the system 


W=Y; et Gul Y**)* = yiglY*) , i=1,2. 


Let z = Y*+ and g(z) = r,,.91(z) + r2492(z). Then z = zg(z). 

Now let us consider different sub-cases. If g = 0, then the integral curves are 
z =const., that is, y. = cy,"'*"**; their behavior is pictured in figure 63. Here, 
the integral curves pass into the curvilinear sector of the set %y(e) crossing the 
upper boundary, then approach the origin tangent to the lower boundary; some 
of the curves stay within the sector, while others exit across the lower boundary. 
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0 YN 


Fig. 64 


Fig. 65 


Now let g # 0, g = az* + ---; then the solutions are 


dz A, "| 
= cern | AE = cern] A+), i=1,2,; 


But how do these curves behave as z > 0? If k is odd, then the curves will behave 
as in figure 64 in two of the four curvilinear sectors of the set Wy(e); in the other 
two sectors, they will be as pictured in figure 65. If k is even, then the integral 
curves will be similar in all four sectors: as in figure 64 if A,/a > 0, and as in 
figure 65 if A,/a <0. 

Thus, the behavior of the integral curves in the curvilinear sectors of the set 
%€,(e) is determined by the parity of the number k and the signs of the numbers 
a and /;. The formal normalizing transformation does not always converge 
here but, evidently, there always exists a smooth normalizing transformation. 
Therefore, the integral curves of the original system in the set Wy(e) will behave 
like those of the normal form. 
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Fig. 66 


d) The line L intersects the sector V at interior points. Here the normal 
form is system (13), or else takes a more complicated form. Again, the formal 
normalizing transformation may diverge, but there always exists an infinitely dif- 
ferentiable normalizing transformation (this follows from the results of Sternberg 
[1958, 1959] with the help of power transformations). The behavior of the 
integral curves in the curvilinear sectors of the set Wy(¢) is pictured in figure 66. 
The solutions of the original system will behave similarly. 

Thus, we have been able to determine the behavior of the integral curves of 
system (1) in the set Wy(¢) in all cases, without any calculation whatsoever; all is 
determined directly by the coefficients of that system. In cases b) and d), it is 
sufficient to know 4, and 4,, while in case c) it is also necessary to use those 
coefficients f;9 for which the vectors Q lie along the corresponding side of the 
sector V. 

Actual reduction to the normal form is only necessary if we seek more exact, 
analytic expressions for the solutions. 


2.3. The Third Theorem on the Normal Form 


Here we consider a situation which is in a sense intermediate between the 
situations of the principal theorem and the second theorem on normal forms. 
Let the right-hand sides of the equations in system (1) be series in integral, non- 
negative powers of x,, and let the set D(F) lie entirely within the sector 


V={0:Q0=4,R* + a,R,,0, 20,0, 20}, 


where the vectors R* = (r#,r#) and R, =(r,,, —1) are such that r¥ <0 < rj, 
Ty» > 0, and |r#/r¥| < r,,. That is, the coefficient f,9 in the series f,(X) vanishes 
unless the vector Q lies in the sector 


1V = {0:0 =a,R* 4+ 0,£,,a, 20,0, 20}. 
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Fig. 67 


We will denote the class of such power series f, by ,¥(X). The coefficient fog in 
the series f,(X) will vanish unless the vector Q lies either in the sector ,V or along 
the ray g. = —1, q; >1,,, (figure 67). We will denote the class of such series 
by ,¥ (X). The set in X space corresponding to the sector ,V is 


Ue, X) = {X: |X|" =e |x,| < 8} 


(see figure 33). We will call the series f; ¢ ,;W/(X) convergent if the series @; = X;f, 
converge absolutely on some set Wy(e). 

Note that the sector ,V is contained in the sector V, but that the set ,@(e) 
contains the set Wy(e). 


Lemma 2. If the series f, of system (1) are convergent series of class ;V(X), 
and the series h; in transformation (6) are convergent series of class ;¥'(Y), then in 


the system (7), obtained from system (1), the series g; are convergent series of class 
iV (Y) as well. 


Proof. We note that all series of class ,¥(X) or ,»W (X) also belong to the class 
¥ (X) corresponding to the sector V. Recalling lemma 1, we realize that the series 
9:(Y) must be convergent series of class ¥(Y). It is evident from formula (10’) 
that the series w,(Y) contain only non-negative powers of the sgcond coordinate, 
Y2, since the series y, and €, have this property. Thus, the series yy; converge 
absolutely on set (11) and contain y, only in non-negative powers. But the first 
inequality in expression (11) 

|Y|**<e (14) 


implies that lyil"*lyal"* <¢, or 


Ivo] Belly lo . (15) 
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This places a lower bound on |y,|. Let the series Wy = ) Wg Y2 contain y, only 
in non-negative powers, and let it converge absolutely at a point y, = y®, 
y2 = y$. Then the series converges absolutely at any point (y,, y2): |y:] = ly°l, 
|y2l < |y$|, since at such a point Y the series y is majorized by the convergent 
series )|¥gl|Y°|®. Consequently, if the series converges absolutely along the 
curves |y,| =e" ]y,|"'*, then it converges for all Y satisfying |y| < e7!|y,|"™. 
That is, the region of absolute convergence of the series ¥, and yw, is not limited 
by inequalities (14) and (15); it is limited only by the remaining inequalities in 
expression (11). Thus, the series W; belong to the class ,W(Y) and converge 
absolutely on the set ,W(e, Y). The lemma is proved. 0 


We proceed to consider formal series of class ;¥. 


Theorem 2. If the series f; in a formal system (1) are of class ,¥'(X), then there 
exists a formal change of coordinates (6), where the h; are series of class ;W(Y), 
which transforms system (1) into a system (7) in which the g; are series of class 
;V(Y) consisting only of resonant terms gig Y2 with <Q, A> = 0. 


The proof is analogous to the proof of the principal and second theorems on 
normal forms (see sections 1.5 and 2.1 of Chapter II), and is therefore left as 
an exercise for the reader. 

Let us briefly consider the integrals of these normal forms on the sets , W(e, Y) 
for various cases: 

b) The line L intersects the sector ,V only at the origin (figure 67, L = L(b)). 
The normal form has either the form of (13), or is ¥; = 2,y1, V2 = Azy2 + ayT. 
In either case, the integral curves enter the curvilinear sectors of the set ,W(e, Y) 
and, staying inside them, approach the origin (figure 68). 

c) The line L is the q, axis. Here, just as in the situation of the principal 
theorem (section 1.5), we obtain four different kinds of behavior (figures 55-58). 


Fig. 68 
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Fig. 69 


d) The line L intersects the sector ,V at interior points. In this case, the integral 
curves behave as pictured in figure 69. Here, only one integral curve in each 
curvilinear sector of ,W%(e, Y) approaches the origin (as in the case of figure 58). 


Remark. The principal theorem on normal forms is analogous to theorem 2 
in §3 of Chapter I, in the sense that each theorem gives a simplification of the 
original problem, using the linear terms. The second and third theorems on normal 
forms (sections 2.1 and 2.3, Chapter II) correspond to a vertex of the Newton 
open polygon. In that sense, they are analogous to theorem 5 of section 3.7 in 
Chapter I. 


Exercises. 1. In case c) of section 2.2 for odd k, determine the disposition of 
the curvilinear sectors (like those in figures 64 and 65) in the various quadrants 
of the y,, y2 plane, depending on the parity of the coordinates of the vector R, 
and the sign of the quotient 1, /a. 

2. In the situation of section 2.2, examine the case where the line L passes 
through R*. 

3. Prove theorem 2. 


§3. A Nonelementary Singular Point 


3.1. Statement of the Problem 


For a system 
X= @AX1,X2.), 9(0,0)=0, i=1,2, 


let the functions @, be analytic at the origin, and let the origin be a non-elementary 
singular point. Then we can write the system as 


X= xfp= xi ¥ fioX? (1) 


D = D(F) = {@: |figl + Ifol #9} - 


Clearly, D < N. We represent D as a set of points in the q,, g, plane, and we 
find the convex Newton open polygon / which forms the boundary of the 
convex hull of the set D for orders P < 0. We denote the extreme vertices of 
f by Q, and Q*, as earlier. We index the vertices J; from bottom to top 
(7° = Q,,..., 4 = Q*), and likewise index the edges. 

We present here a method of integrating such a system in a neighborhood of 
the origin. For each vertex J;, we create a corresponding set %{(e). We assign 
the space &")(e) between the sets %(e) and %{9) (z) to the edge Jj”. Thus, the 
curvilinear sectors of the sets %{(e) entirely fill the neighborhood of the origin. 
In each of the sets %{(s), the system can be integrated by reducing it to a normal 
form (in general, the generalized normal form). The investigation of the system 
in the set &@"(c) reduces with the help of a power transformation to studying a 
number of singular points of a second system. Each of these singular points is 
simpler than the original. Hence, after a finite number of steps the set %!(¢) will 
be divided into a finite number of subsets, % (e), in each of which the system can 
be integrated with the help of a normal form. It then remains to “sew together” 
the results of integrating in the various regions %?(e). Since we are interested 
only in the integral curves, in what follows we shall make various changes of the 
time variable. 
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We write system (1) in vector form: 
(In.X) =) FyX@ = F(X) . 


Let 5 be an edge or vertex of the open polygon F of this system. Let us call 
the function FOX) = ¥ Fy X2, where Qe J, the truncation of the vector- 
function F. Just as in the case of a scalar function f (see §2, Chapter I), this 
truncation has its own cone U!®. The only new element is the fact that the 
truncation F ¥ 0 as a vector, so that if just one component differs from zero, the 
other may vanish and the vector will still be said to be non-zero. For example, 
if f, = =X, +X and f, = x? + x3, then the truncation for any vector order P <0 
is F = (hy, 0). Here, the second component of the vector F is not the truncation 
fy. We will henceforth denote the components of F by f;. For every truncated 
function F there is a corresponding truncated system 


(In. X) = F(X) . (1’) 


3.2. The Case of a Vertex 


Let J;°) = © be some vertex of the Newton open polygon &* of system (1). 
Let 5°? and T;” be the edges which intersect at Jj, and let Be , and R; be unit 
vectors along those edges, with r, ,-1,12,; > 0. We choose R, = —Rj-1, R* = Rj. 
If @ is an extreme vertex of [, for example 0 = Q,, then we ‘anoose R, = (1,0); 
if 0 = Q, we choose R* = (0,1). We now divide system (1) by X2; that is, we 
introduce a new “time” t, such that dt, = X2dt. Then system (1) becomes the 
system 


dx,/dt, =x; Y figX2@, i=1,2. 2 
x;/dt Xi 2 Sie i (2) 


The exponents Q — O of this system are obtained from the exponents Q of system 
(1) by a parallel translation along the vector 0. Hence, the vertex J; of system 
(1) corresponds to the zero exponent in system (2), so that A = Fo. Al the vector 
exponents Q — O lie in the sector V which is bounded by half-lines along the 
vectors R, and R* (figure 61). Therefore, system (2) satisfies the conditions of 
the second theorem on the normal form (section 2.1 of Chapter II). According 
to that theorem, there exists on the set 


UH(6) = {X: [XB <e, |X| <e} (2') 
a coordinate transformation 
x= y+ EA), i=1,2, (3) 
under which system (2) is transformed into the normal form 


dy/dt,=y, Y gigY¥®, i=1,2. (4) 
<0,45=0 
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As we showed in § 2 of this chapter, this normal form is integrable. The solutions 
of the normal form (4) yield, through transformation (3), the solutions of system 
(2) in the set (2’). Since the integral curves of systems (1) and (2) coincide, it is 
possible by this method to find the integral curves (and solutions) of system (1) 
in the set (2’) with any degree of accuracy. If we are interested only in the behavior 
of the integral curves in a neighborhood of the origin, then it is sufficient to find 
the character of that behavior in the set (2’). As shown in § 2 of this chapter, it is 
unnecessary to do any calculations for this purpose—simply knowing the coeffi- 
cients of system (1) is sufficient. All of this is a general construction. For a system 
(1) with analytic right-hand sides, there are some specifics which simplify the 
analysis of vertices as well as the general picture. We will identify four types of 
vertices O. 


Type I. The vertex @ is one of the extreme vertices, Q, or Q*, and lies outside 
of the first quadrant. Then one of the coordinates of 0 is —1. Suppose that 
42 = —1; then the point O belongs to the set N. but not to the set N,. Therefore 
the coefficient /,g in system (1) is zero. The vertex O = Q, lies on the lower 
horizontal support line of the set D(F), so that R, = (1,0), and the corresponding 
set We) is 

U,(e) = {X: |X|" <e,|x,|<e} . - (5) 


This set has the same form as is pictured in figure 33. Since A = Fg = (0, fog), 
the solutions of the equation ¢Q, A> = 0in the sector V are (k, 0). Since <R,, A> = 
0, both our second theorem on the normal form (section 2.1) and the remarks 
about that theorem apply. According to the remark, the normal form (4) has the 
form 


dy,/dt, = y; 2 Giu.oy ¥" = 9; Y fia.o+O¥i ’ i=1,2. (6) 


But here f«,0)+g6 = 0 since the second coordinate of each vector index (k, 0) + @) 
is equal to — 1; that is, these vector indices do not lie in the set N,. Consequently, 
the first equation of system (6) is dy, /dt, = 0, so the lines y, = const. are integral 
curves of the normal form (6). Hence, in the set (5) corresponding to this first 
type of vertex, the integral curves will behave as they do in figure 33; that is, in 
the set %,(¢), there is no integral curve which passes through the origin. 


Type II. The vertex © is one of the extreme vertices Q, or Q*, and O > 0. 
Suppose that 0 = Q, > 0. Then the vertex 0 corresponds to a sector V in the 
41> 42 plane, bounded by rays collinear with the vectors R* = (r¥,r¥) and 
R, = (1,0), where r* < 0 < r¥. For each point Q € D(F), the difference Q — 0 
lies in V. The vector V corresponds to the set (5) in the x,, x, plane. The second 
theorem on the normal form applies to system (2); it implies that the x, axis is 
an integral curve. The behavior of the integral curves in the rest of set (5) is like 
that in figure 68, figure 69, or figure 55. All of the cases enumerated at the end 
of section 2.3 are possible here. 
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Type III. O is not an extreme vertex, but one of its coordinates is zero. Suppose 
that g, = 0. In our general construction, this vertex corresponds to a sector V 
and a set (2’). But, by the construction of section 2.3, we can create the sector 


WV = {0:0 =a, Rf + 4, E,3%,,4, 2 0} (7) 
and a set , 
We) = {X: |X| <e|xq| <e} . (7') 


corresponding to such a vertex. Here 


vo V, Uy (8) < Ue) . 


The construction of set (7’) is similar to that of set (5). Our third theorem on the 
normal form (section 2.3) applies to system (2) within set (7’). At the end of §2, 
we showed that in any curvilinear sector of the set (7’) there must be at least one 
integral curve which passes through the origin (in case (c), it may consist of fixed 
points). In the situation under consideration, the vertex O is F); to its right on 
the Newton open polygon are the edge I") and the vertex I = Q,. We note 
here that set (7’) includes both W{!"(e) and %((e), the sets corresponding to I!) 
and 7{. Without the third theorem on the normal form, we should have to study 
system (1) in three different sets @(e), W{(c), and We). The third theorem 
allows us to examine the system in set (7’), which includes all three of the sets 
named. That is, the integration of the system in the set (7’) corresponding to the 
vertex © includes integration in the set %(e) corresponding to the vertex 
Q, = 1. Therefore, it is possible to treat a vertex of the first type without 
creating a separate set %@{°'(c) and integrating the system there, as long as it is 
adjacent to a vertex of type ITI. 


Type IV. 6 > 0, and O is not an extreme vertex. For such a vertex, there is 
no way to simplify the application of the second theorem on the normal form. 
Here, the set %(e) consists of four curvilinear sectors, as in figure 31. In the first 
quadrant of the x,, x, plane, the sector corresponding to one vertex lies above 
the sector corresponding to a lower vertex (see § 3, Ch. I). As remarked in §2 of 
this chapter, the behavior of the integral curves of system (1) in the sectors of the 
set U(E) can be determined from the coefficients Fg with no further calculations 
necessary. 


3.3. The Case of an Edge 


Let J; be some edge of the Newton open polygon of system (1). The 
endpoints of this edge are the vertices Jj and 159), and the unit vector along 
the edge is R = R, = (J) — J;)/m. The sets @(e) and %}9) (e), corresponding 
to the vertices 5 and [5{), are restricted by the inequalities |X|* <«¢ and 
|X|~® < «. Hence, for a set corresponding to the edge J;', we are left with 


UME) = {Xie <|X|*® <4, [x4] <@ |x| <e} - (8) 
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Just as we did in §3 of Chapter I, we investigate the behavior of system (1) in set 
(8) by applying a power transformation 


yi = xix? > Y Ss 
x=( *) (9) 


Paes r ir 
Vz = Xx? 


where the matrix « has integral elements and det x = t. We note that if X > Oin 
the set #(e), then y, > const. # O and y, > 0. In fact, s.r, = 1 + 527,50 


—_ Si? Sols = r F2\S2 _ 
Vy? = XING? = XA (xX)? = Xp? 


Consequently, yj? 0 as x, 0, since y, -»> const 4 0,00; recall that r, > 0. 
Thus, part of the set @'(e) near the origin is transformed into a part of a 
neighborhood of the y, axis: 


MMe) ={¥ilyl<e< St a ae (10) 
Under this power transformation, system (1) becomes 


(nY)= Y FOY%=F(¥), (11) 
ved’ 
the open polygon / of system (1) becomes the polygon /” of system (11), and the 
edge J; becomes a vertical edge [;"”. The truncated system (1’), corresponding 
to the edge /;", is transformed into a truncated system 


(nyy= Yo FY’ =F(Y), (12) 
Q’ eDary” 
corresponding to the edge J} (fig. 16). Since Jj" is a vertical edge, its points 
Q have a common coordinate qi =r; ie., F’(Y) = yt Fy'(y2). We now divide 
system (11) by the maximum power yj such that we obtain a system 


dyfdt,=yyAO)=C(Y), i=1,2, (13) 


where the right-hand sides are expressed only in non-negative powers of y,. Here, 
dt, = y,°dt. 

Thus, we arrive at the following problem: Investigate the behavior of the 
integral curves of system (13) in set (8), recalling that the y, axis corresponds to 
the point x, = x, = 0. In order to solve this problem, we need to clarify the 
character of those integral curves which pass through the finite points of the y, 
axis but differ from y, = 0. It remains to be noted that after division by y%, 
truncation (12) of system (11) becomes the truncated system 


dy,fdt, = yi yfi(y2),  i=1,2 (14) 


of system (13). We can identify two types of such truncations (14). 
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Type I. The second component of the vector F’(Y) is Fi(Y) # 0. In this case, 
r = s, and the y, axis is an integral curve, along which ¢,(0, y.) = 0. The points 
of this axis, at which ¢,(0, y,) # 0, are simple. In accordance with theorem 1 (§ 1, 
Ch. ID, the integral curves in the neighborhood of such a simple point are lines 
parallel to the y, axis. If we consider the fact that the y, axis corresponds to a 
single point, x, = x, = 0, then it is clear that the corresponding integral curves 
of system (1) cannot approach the singular point. Now we are left with the 
singular points on the y, axis (yz # 0, y. # 00). 

Such singular points, say y, = 0, y. = y%, are obtained as roots y, = y$ of 
the equation f4(y,) = 0. If at such a point dfs./dy, #0 or f{o # 0, then it is 
an elementary singular point of system (13), and its neighborhood can be investi- 
gated by transforming the system into a normal form. If, however, df{,/dy, = 0 
and fo = = 0, then the point is a non-elementary singularity, and to determine 
its character one must proceed to construct a Newton open polygon, and so on. 
However, it is now a simpler singularity than the original. 


Type II. If #(Y) = 0, then fk # 0, and s=r-+ 1. In this case the y, axis is 
not an integral curve of (13). If f/o(y9) #0, then the point (0, y9) is a simple 
point of system (13); one integral curve, which is not tangent to the y, axis, passes 
through this point. In the x,, x, plane, the corresponding integral curve passes 
through the singular point x, = x, = 0. If f{o(y) = 0 and ¢,(0, y2) #0, then 
the point (0, y$) is a simple point of system (13), but the integral curve through 
the point is tangent to the y, axis there. Along this curve, y, can be expressed in 
integral powers of y, — y} (Cauchy’s Theorem), and we can investigate this 
solution. If f{o(y2) =0 and ¢,(0,y$) = 0, then (0, y$) is a singular point of 
system (13). If it is an elementary singularity, the system is integrable in a 
neighborhood of the point; if it is non-elementary, it is necessary to construct a 
Newton polygon, divide the neighborhood into sets %'(e), and so forth. 

Thus, for both types, we must find at the y, axis the real roots y$ of one of 
the algebraic equations: 


fio(y2) =0 fortypeI and 


fiol(v2) =0 fortype ll . 


There is a finite number of such roots: y$), ..., y¥. Each point y,; = 0, y, = y#? 
has its own neighborhood %;(«) in the y,, y. plane. The remainder of set (10) is 
divided into a finite number of regions %(e) which contain no singular points. 
The system is integrable in each of these regions using theorem 1 of § 1 in Chapter 
IL. In the neighborhood @%,(e), the system (13) is integrable either immediately 
or through further division into sets %;,(e). The division of set (10) into the parts 

Ue) and W%;(e) corresponds, in the x,, x2 plane, to dividing the set %(¢) into 
more refined sets W{(e) and We). Moreover, system (13) (and, consequently, 
(1)) is immediately integrable in the sets %"(c), while in the sets W(e) further 
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refinement may be necessary. This depends on the complexity of the singularity. 
Generally speaking, this process of successive fragmentation of the sets %@(e) and 
resolving singularities may require many steps, but it will always provide a 
solution in a finite number of steps. This can be proved for differential equations 
exactly as it was for analytic equations in section 2.9 of Chapter I, using our 
concept of the “height” of edges and singularities. 


Remark. The functions f/o(y.) and fio(y2) not only determine the distribu- 
tion of secondary singularities, but also, in the case of an elementary singularity 
of the first type, yield the eigenvalues 2, = f{9(y$) and 2, = df§q(y$)/dy2. Thus, 
in many (but not all) cases, these functions entirely determine the character of 
the behavior of the integral curves. But these functions f{o(y,) and fSo(y2) are 
obtained from the truncated system (12), which itself is derived from the truncated 
system (1’) with a coordinate change (9). Consequently, many properties of the 
solutions of system (1) can be discovered from the truncated system (1’). Similar 
investigations of an n-dimensional system have been made by Shestakov [1960, 
1961], Bruno [1965], and Beklemisheva [1972]. 

A second practical method which facilitates calculation is limited to cases 
when the edge /;‘” lies entirely outside the first quadrant of the q,, q, plane. We 
need not examine it here since a vertex O of this edge must lie on the boundary 
of the first quadrant, and is thus a vertex of type III; the set (7’) corresponding 
to that vertex entirely contains the set %{")(e) corresponding to the edge J"), so 
that the last set need not be treated separately. 


3.4. Synthesis 


Thus, a neighborhood of the non-elementary singularity X = 0 of system (1) 
is divided into a finite number of sets %;;,.__,(¢). In each such set, we introduce its 
own coordinates y,, yz in which the system is integrable. Returning from Y to 
X coordinates, we obtain parts of the solutions of system (1) in each set Wj, ,(€) 
separately. Once these parts of solutions are “sewn together,” we will have 
integrated the system in the entire neighborhood. But a new difficulty arises here. 
The transformations from Y to X coordinates employ infinite series, so that these 
transformations can, in reality, only be calculated approximately (i.e., to terms 
of some finite degree). As a result, the parts of the solutions X in each of the sets 
Ui... ,(e) will only be approximate. And even though these parts of the solutions 
can be obtained arbitrarily accurately, the complexity of the calculations in- 
creases with increasing accuracy. To sew together the parts of the solutions in 
neighboring parts of the sets W,, _ ,(e), it is necessary that the approximate 
solutions be sufficiently close to the actual solutions. This requires different 
amounts of calculation in different cases. 

Let us consider the problem of clarifying the topological portrait of the 
integral curves in the neighborhood of a non-elementary singular point. We 
identify two cases: 
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1) Among all the sets %;,.__,() there is at least one in which an integral curve 
approaches the singularity. In this case, it is sufficient to divide the neighborhood 
& into sets W,,__(c) and to integrate system (1) in each of those sets in the 
appropriate Y coordinates. Sewing together the neighboring parts of the solu- 
tions with topological accuracy presents no problem. 

2) Not one of the sets W;,,.__,(¢) contains an integral curve which approaches 
the origin inside that set. Instead, the integral curves pass from one set W;,___;(€) 
to another, and never stay in any one. In this case, the integral curves simply 
twist around the singularity, either as closed curves (in the case of a center) or 
as spirals (in the case of a focus). The problem of distinguishing between a center 
and a focus is, generally speaking, extremely complicated. We leave it for §4 of 
this chapter. 


3.5. Examples 1, 2, 3 
We now consider a few examples of the first case. 
Example 1. We consider the system 
X= X_ +x? —XyxX_ = xy (xp1xp +X, — x2), 
Xq = ax]? + 3x,x, — 5x3 = x,(ax?xz! + 3x, — 5x.) , 
where the parameter a is non-zero. Here, 
Q=(-L), @2=0,0, Q;=(01), Q,=(2,—1); 
Fg, = (1,0) , Fo, = (1,3) , Fo, =(—1,-5), Fy, = (0,4) . 


The Newton open polygon F consists of a single edge [) and the two vertices 
Tr = Q, and Ff = Q, (fig. 70). Both of these vertices are of type I: the 
disposition of the sets W%{°(c) and &%°(c) and the behavior of integral curves in 
them is roughly sketched in figure 71. The remaining set W{!(e) is restricted by 


q 
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Fig. 71 


Fig. 72 


the inequalities e < |X|® < «"!, where R = Q, — Q4 =(—3,2) is the unit vector 
along the edge J"). If we seek a vector S = (s,,5,) such that s,r, — yr, = 1, we 
can choose S = (2, — 1). We then perform the power transformation 
Yr =x", yp = x7°xd ; 
whose inverse is 
ieee ma ayo aye 
x1 =Viy2 > X2=YViy2 - 

The original system becomes 

(In'y,) = 2y, yo — y2yo + 3y3y3 —a 

Vi Yi¥2 — Viy2 Yiy2 Vis 
(In y2) = —3y,¥2 + 3yty2 — Tyiy2 + 2ay, ; 


the second component of the truncated system, ff = —3y,y2 + 2ay,, is not 
identically zero (figure 72). Consequently, it is of type I by the classification of 
section 3.3. Dividing by y,, we obtain the system 
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dy,/dt, = 2y;¥2 — vivo + 3yiy3 — ay, , 
dy,/dt, = —3y3} + 3y,y2 — Tyty2 + 2ay2 . 


Here, the y, axis is an integral curve; the only singular point on it is y, = 0, 
y$ = 2a/3, obtained from the equation 


fiol¥2) = —3y2 + 2a=0. 


The only integral curve passing through all remaining points y, = 0, y, # 
y8, 0, is the y, axis itself. To investigate the neighborhood of the singular point 
Y = (0, 2a/3), we make a parallel translation 


Yo =YO+ 22 =fat zy ’ 
and obtain 


dy, /dt, =4ay, + 2y,2z. — yi(4a + z2) + 3yiGat 2)" , 
dz,/dt, = —2az, — 3z3 + 3y,(4a + z,)* — Ty?(4a + 23)° 
= —2az, + tay, + 4ay,z, + 3y,22 — 323 — 7y?(4a +22). 


The matrix of coefficients of linear terms of this system is 


a 
- 0 
3 
A= 
=a* —2a 
and its eigenvalues are 4, = a/3 and J, = —2a. For any real a # 0, we find that 
A= A,/4, = —1/6. The singular point is a saddle, and exactly two integral curves 


pass through it. One is the z, axis, and the other is tangent to the eigenvector B, 
of the matrix A. We find that B, = (7,4a) from the equation AB, = 4, B,; that 
is, B, lies along the line z. = 4ay,/7. Thus, the only integral curve which crosses 
the y, axisisz, = 4ay,/7 +--+, ory. = (2/3)a + 4ay,/7 + --: (fig. 73). This curve 
has the two half branches F/(y, > 0) and ¥;(y, < 0). In the original coordinates, 
we have 


= 29,2 47,3 Bie — 4,,2,,3 16,,,,4 ee 
X= gayy + Fay, + ’ X2 = 90° + aay + 


This is the only integral curve which passes through the origin x, = x, = 0; it 
consists of two half branches ¥, and ¥, (shown for a > 0 in figs. 71 and 74). The 
remaining integral curves behave as shown by figure 71. Thus, the general picture 
will be as sketched in figure 74 (for a > 0). 

Note that the truncated system corresponding to the edge [ is sufficient to 
construct figure 74. In fact, the power transformation and the values of y$, 4,, 
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22 


x2 


x 


F) 


Fig. 74 


and A, were all obtained from the coefficients of the truncated system. Only the 
calculation of the first term in the expression of z, in terms of y, along the 
half branches ¥, and ¥, required us to use terms of the original system that 
were not included in the truncated system. And this gave us the second terms in 
the expression of x, and x, in terms of y, along the half branches ¥, and ¥,. 
In our case, these second terms are not important for sketching the picture; the 
behavior of the half branches ¥, and F, is entirely determined by the first terms 


me Fe) ae G28 bie 
xX, =3ayp to, X2 = 90° yi + 


Thus, in this example it would have been sufficient, for the purpose of finding 
the behavior of the integral curves, to perform all of our calculations (i.e. the 
power transformation and finding the root y$) with the truncated system. This 
would significantly simplify computations. Therefore the following method of 
calculation is recommended for investigating the set &{"(e) corresponding to an 
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2 


Fig. 75 


edge J;"”). Instead of the original system, take the corresponding truncated system 
and apply to it the necessary transformations. Only ifit is impossible to determine 
the character of the solutions from the coefficients of the truncated system (ie., 
in case of an edge of type II, and also an edge of type I if 4, = 0 at the singular 
point on the y, axis) should those transformations be carried out with the 
remaining terms of the original system. 


Example 2. 
Xp = X_ + ax? + 5x, x} = x,(xy!x, + ax, + 5x3), 
X_ = —2x? + 3x,x3 — 4x} = x,(—2x}x3! + 3x, x, — 4x3) . 


The open polygon fF (fig. 75) consists of one edge and two vertices. Both vertices 
are of the first type, so that the disposition of the sets Y{(e) and integral curves 
will be just as in example 1. The unit vector along the edge is R = (—2, 1) and 
the truncated system is 


Ky = x,(xy'x, +x),  %,— —2x7'. 


We choose S = (1,0) for the given R and introduce new coordinates y, = x,, 
y2 = Xj]7x2; the inverse transformation is x, = y,, X. = y?y2. The truncated 
system becomes 


Vi =Viv2tayt, yo = —2y, — 2ayy2 — 2yiy3 - 
Letting dt, = y,dt, we obtain 
dy, /dty =V:y2 +4), =9iV1 5 dy,/dt, = —2 — ay, — 2y3 =q, . 
To find the singular points along the y, axis we obtain the equation 


—292 =yitay,+1=0, 
whence we obtain 


a 
(ene sean NG, 
me Name 
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If |a| < 2, then there are no real roots and none of the integral curves in the set 
M¢\)(e) approaches the singular point. Since no such curves exist in the sets %{(e), 
we are faced with the problem of distinguishing between a center and a focus. 
That problem can be solved for this system, but we will not do so here. If |a| > 2, 
there are two real roots. For the sake of discussion, let us choose a = — 4. Then 
y3 = 2 + ./3. The eigenvalues at these points are 


A, =9,(9)=at y8=-24/3<0, 


dy = dga(y2)/dy2 = —2(a + 2y$) = F4,/3 . 


Therefore, for the root y= 2 + ./3, we have 4 = A,/A, > 0, and the singular 
point y, = 0, y$ = 2 + ,/3 is a node. For the other root, yg = 2 — aids we have 
A <0, and the singular point y, = 0, y, = 2 — J3 is a saddle. The behavior of 
the integral curves is sketched in figure 76. Returning to the original variables, 
we obtain figure 77. 

If |a| = 2, the roots coincide. Suppose that a = —2; then yf = 1,4, = —1, 
A, = 0. We let z, = y, — y$ = y, — 1. To investigate the system in the entire 
neighborhood of y, = 0, z, = 0, we must find the normal form. In the complete 
system 


Fig. 76 
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22 


Fig. 78 


dy,/dt, = y, >; higyi'z? , dz,/dt, =z, > hog yf'zP 


all terms have q, > 0, and all of the terms with g, = 0 are obtained from the 
truncated system. On the other hand, A = (— 1,0), so that the normal form 


dw,/dt,; =w, > gigW2, i=1,2 


will only have terms with q, = 0. But according to the remark made after the 
second theorem on the normal form, we will have gio,4,) = hio,q,) here, so that 
the normal form is, in fact, just the truncated system. Thus, investigation of the 
full system in the neighborhood of the singular point y, = 0, y. = 1 reduces to 
considering the truncated system 


dy,/dty; = —y, + ¥122, dz,/dt, = —223 ; 


this integrates to (fig. 78) 


1 1 
In|y,| = =a gill ca Ome 
2 


In x,, X, variables, this gives us figure 77 (with just the single branch F;). 


Example 3. 
X, = 4x3 + x, x. — 2x? = x, (4xy'x, + x2 — 2x4), 
Xp = x3 4+ 2x, x. + 2x} = x,(x. + 2x, + 2x}xz!) . 


The Newton open polygon (fig. 79) consists of three vertices and two edges, 
Tf) > Q4, Q; and FJ” > Q5, G2, O,. Here, R, =(—2,1) and R, = (—1, 1). The 
vertices 1° = Q, and 7{° = Q, are of the first type, and the behavior of the 
integral curves in the sets W@(e) and We) is trivial (fig. 80), but the vertex 


150 Chapter II. A System of Two Differential Equations 


2 


Fig. 80 


I$ = Q, is of the third type, and its set ,&§(e) contains the sets W(s) and 
U''(c), which correspond to the vertex /\° = Q, and to the edge [). For the 
vertex [f° we have R, = (1,0), R* = R2, and A = Fy, =(—2,2). Since the 
scalar products (R,, A> = —2 and ¢R*, A> = 4 have opposite signs, the line L 
intersects the sector ,V at the interior points. This is case d) of section 2.3; the 
integral curves in the curvilinear sectors of the set ,&$(e) behave as in figure 69. 
In order to more accurately define the behavior of the unique integral curve 
which passes through the origin, we make use of the edge J}. We need to 
perform a power transformation and find a fixed point on the vertical axis. 
Instead, we will seek a solution which expresses x, as a power series of x, 
(that is, instead of searching for an analog of the second method of resolving 
algebraic curves, we will employ an analog of the first method). Since R, = 
(—2, 1), then », = 2, and, for the truncated system (corresponding to the edge 
rH”) 
X= —2x7, 9 X= 2x, x2 + 2x? 
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we must find a solution of the form x, = bx?. We write this system as a single 
equation 

dx, xy + x7 

dx, a 


Substituting x, = bx? into this equation, we obtain an equation for b, 2b = 
—(b + 1), whence b = — 1/3. Consequently, the one integral curve which passes 
through the origin in the set ,&§(c) is defined by 


a= JRE 
In order to consider solutions corresponding to the edge I", we make the 
transformation 


Zz _ y-l 
y=, Y2 =X, X2, 


and obtain 
di = yi4yiy3 + vivo — 21) = yil4y3 + y2 — 2), 
Yo = Yo(—4y. 3 + 41 + 2y7y2") = yi yo(—4y3 + 44 yy yo) . 


This is of type I, section 3.3. For the fixed points y, = 0, y, = y$ we obtain the 
equation y3 = 1, hence y9 = +1. The eigenvalues at these points are i, = 2 + 1, 
A, = +6. The point with y3 = 1 is a saddle (one integral curve, y,) =1+-°°, 
passes through it), while the point with y? = —1 is a node (an infinite num- 
ber of integral curves, y, = —1+-::, pass through the point). Consequently, 
there is one curve F, (x, =x, +--+) and an infinite number of curves F, 
(x. = —x, +°*-) (fig. 81). 


Fig. 81 
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Exercise 1. Outline the behavior of the integral curves of the following systems 
near the origin: 
1) X= y—x? + 4y?, 

yp=axyt+3y?+x*,a= —1, -2, -3, —4; 


2) xX = x3, 
poxtt x24 x?y + Qxy st y?; 
3) X = xy, 
pox?—y?; 


4) The system of example 1 with a = 0. 


3.6. Comparison with Frommer’s Method 


Frommer [1928] suggested a method of investigating integral curves in the 
neighborhood of a non-elementary singular point. This method, while it is 
different in terms of technical details, actually has much in common with the 
local method. Frommer used an analog of the Newton open polygon (section 
1.7, Chapter I), reduction of singularities to elementary singular points (similar 
to the first method in section 2.8 of Chapter J), and the division of the neighbor- 
hood of the singular point into curvilinear parts (“Frommer’s regions”). These 
Frommer’s regions are generally analogous to the components of the sets We). 
But Frommer’s method differs from the local method in terms of investigating 
the integral curves inside these regions: specifically, Frommer did not have 
the idea of the normal form. Instead of finding the normal form, Frommer proves 
a series of theorems which establish a connection between the behavior of the 
right-hand sides of the equations of the system in a Frommer’s region (and along 
its boundary) and the behavior of the integral curves in that region. In those 
cases when the disposition of the integral curves is determined by the higher- 
order terms of the normal form, Frommer’s method requires an even more 
detailed investigation of the properties of the right-hand sides. This has produced 
a considerable body of work (see, for example: Nemytskii and Stepanov, 1947; 
Kukles, 1958, 1964; Andreev, 1970; Andreev and Gleiser, 1972), 

Frommer’s method is preferable to the method of the normal form only in 
the following situations: if the right-hand sides in the given system are not 
analytic, but have only a finite number of derivatives, and if the corresponding 
finite Taylor expansions are insufficient for calculating the determining terms in 
the normal form. 

We note that the local method of investigating the neighborhood of non- 
elementary singularity consists of a sequence of algebraic operations. This 
sequence of operations and the corresponding calculations can easily be pro- 
grammed on a computer (see Berezovskaya and Kreitzer, 1975). 


§4. On Distinguishing Between a Center and a Focus 


4.1. Introduction 
Suppose that for a real analytic system 
x; = 0X) ’ (0) = 0 > i= 1, 2 (1) 


there is no set &%{(e) (corresponding to an edge or a vertex of the Newton open 
polygon /°) in which an integral curve approaches the origin. Then the integral 
curves in the neighborhood % must be either closed curves (the case of a center) 
or spirals which wind out of the singular point (a focus). This part of the chapter 
is devoted to a method for distinguishing between these two cases and investi- 
gating their properties. Clearly, we can consider instead of the system (1) a single 
differential equation 

dx, = 92(X) 

dx,  ,(X) © 


(2) 

Let us first consider the simplest case, when the open polygon I consists 
of a single edge with the unit vector R = (—1, 1). Then in the truncated system 

xX; = G(X) ’ i=1,2, (3) 

the @, are homogeneous polynomials of odd degree m. We will consider other 
cases in section 4.7. 

But for most of this part, we will assume that: 

1) the expansions of the ¢; begin with homogeneous forms @,(X) of odd degree 


m, and 
2) system (1) has no real directions of approach, i.e., 


X1 Q(X) — x26,(X) #0 for X 40, XeR?. (4) 


4.2. Trigonometric Theory 
We introduce polar coordinates r, 0: 


x,;=rcos@é , X,=rsin@g. 
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Then system (1) becomes: 
F=@,cos?+ g,sin@ , 


- It 
6 sa sin@ + g,cos@) , 


which is equivalent to the equation 


dr _r(p, cos@ + g, sin 8) 
dd —@,sin@ + cos 8 * 


Condition (4) assures us that the denominator never vanishes for r # 0. Con- 
sequently, this equation can be written in the form 


dr/d0 = f(r,6) = 3° r¥f,(8) , (5) 


k=1 


where the f, are 27-periodic functions, analytic in some region {Im 6| < «. 
We now consider the truncated equation 


dr/dO = rf,(@) . (6) 
Separating variables, we can easily obtain the solution 
r=cexpf f,(0)dé . (7) 


We can find the mean value a of /,(6): 


2n 
ane f f@rao 8) 
then solution (7) can be written in the form 
r=cg(@)expaé , (9) 
where 
g(0) = exp | (10) — add, (10) 


This is a 22-periodic function with no real zeros. 

If a £0, then the solutions (9) are spirals (the origin is a focus); if a = 0, 
the solutions (9) are closed curves (the origin is a center). But all of this was 
established for the truncated equation (6). In order to examine the behavior of 
the complete equation (5), we introduce a “generalized polar radius” ¥ = rg~1(@). 
Then equation (5) takes the form 


d?/d@ = aF + y FF (O) , (i1) 
k=2 
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where a 
AO) =hgr" , k = 2, 3,... (12) 


are analytic, 2x-periodic functions, and the series converges for sufficiently small 
7. We attempt to simplify equation (11) by means of a formal transformation of 
the generalized polar radius: 


F=p+ FY hlOpt=pth, (13) 


where the h,(@) are analytic, 27-periodic functions, though the series may diverge 
for every p # 0. 


Theorem 1. There exists a formal change of the coordinate (13) such that equation 
(11) takes the form 


dp/d@ = ap + y. b,p* = ap + b(p) , (14) 
k=2 


where all the b, vanish if a # 0. 


Proof. Differentiating equation (13) with respect to @ and substituting expressions 
(11) and (14) for the derivatives yields 


& Es ‘ 2 dh 
ar + )) P= (1 +): sae + b(p)) + ¥ p*—* 
\ k=2 v2 6 


Substituting expression (13) for f and collecting terms, we obtain an identity in 
p and @: 


hey hoy vay kh,p* +b knot + Yoh ee 


k=2 


This identity will be satisfied if the coefficients of p* on each side are equal. This 
condition yields the equations 


ah, +{f},=b,+akh,+ Y  bjjh;+dh,/d0 , (15) 


itj=kt1 


where { f}, denotes the coefficient of p* in the series ¥ f,(p + h)'. We let 


={fh- Yo buh; (16) 


itj=k+1 


it is evident that each c,(@) is a 2n-periodic function, being a polynomial in h,, 
b,, and f; with indices j < k. We can then write equation (15) in the form 


dh, /d0 = a(1— kh, — by + & - (17) 
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We must now show that this equation has a periodic solution h, for some value 
of the constant b,. Then we can prove the theorem by induction on k. 
If a = 0, then 
h, = —b,0 +  c,(0)d0 , (18) 


and the function h,(6) will be periodic if we choose for the constant b, the mean 
value of the function c,(8): 


n 


b, = c,(0)d0 (19) 


1 
2n 


oun 


If a # 0, it is more convenient to work with Fourier series. Let 


=e t is (c#, cos 16 + cf* sin 10) , 


hy, = hp + a (hf, cos 10 + hx* sin 10) . 


Equation (17) will be satisfied for b, = 0 if the coefficients of cos /@ and sin /6 are 
the same on both sides of the equation. We thus obtain a system of equations 
for the coefficients h,,: 


a(l—kjhp +cP =0; 
Ihf* =a(l —kK)hi, + ch , 
— thf, = a(1 — he* + c#*, «1 =1,2,... 


For jevery I, the last system has a unique solution since its discriminant is equal 
to a?(k — 1)? + I’. The rates elerowie of the coefficients h}* and hj, with respect 
to [ are like those of c#i* and c%,. The function h,(8) is, therefore, mnalyticl in the 
same region |Im 6| < ¢ in which the function c,(8) is analytic. 1] 


Example 1. Let us calculate the constants b, and b, for equation (15) in the 
case a = 0. According to (16), (12), and (19), we have 


C2 =f, =fog , 
2n 


2n 8 
1 1 
b, = = t,gd0 = ar { Fr Oexp( | S,(0,) a0 .)a : 
0 0 10) 
Similarly, we obtain 


C3 = i nu f,2h, — 2byh2 = fg? + 2ha( fog — bp) , 


2n 


I 
ee ae, 
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where, according to (18) 


h,(0) = | (f2g — b2) 46 . 


o— sf 


4.3. The Study of Normal Forms 


In the last section, we showed that, with the introduction of polar coordinates 
and their subsequent transformation, equation (2) transforms into the normal 
form (14). Let us consider the different cases. 

1) If a 4 0, the normal form (14) is the equation p = ap. This is the case of a 
structurally stable focus; it remains such a focus under any “perturbation” of 
system (1) if the order of the perturbation is greater than m. In this case, the 
normalizing transformation is convergent (see Bruno, 1971, 1972a). 

2) a = 0, b(p) # 0. In this case the solutions of equation (14) are spirals 


dp 1 
@= eo Ee eer ae 
lip" ape 
where b, is the first non-zero coefficient in the series b(p) = >}. b,p*. In theorem 
3 of §2 in Chapter IIT, we will show that with the transformation 


equation (14) can be put into polynomial form: 
dp/[d0 = bp" + baysp"* 


This is the case of a non-structurally stable focus (see sect. 1.11 in Chapter II). 
As a rule, the normalizing transformation diverges in this case (see Bruno, 1971, 
1972a; Osipov, 1973), but there exists a smooth normalizing transformation 
(Bibikov, 1971, 1973; Tokarev, 1977). 

3) If a = 0 and b(p) = 0, the solutions of equation (14) are concentric circles 
p = const. That is, the singular point is a center. The normalizing transformation 
converges. 

Note that the transformation from cartesian coordinates x,, x, to polar 
coordinates r, 8 blows up the singular point X = O into the invariant circle r = 0. 
After such blowing up, we applied the method of studying solutions near an 
invariant closed solution without singular points. The specific character of the 
original system (1) was thereby lost. 


4.4. On the Computability of the Coefficients of the Normal Form 


This aspect is manifested, in particular, by the fact that the functions /,(0) in 
equation (5) are rational functions of sin # and cos @; they are, in fact, homo- 
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geneous functions of degree k — 1 of these arguments. Consequently, and in 
agreement with (8), the value of a will be an algebraic function of the coefficients 
of the polynomials ¢; (see Khinchin, 1955, § 65). But the values of the b, (if a = 0) 
will in this case be transcendental functions of those coefficients. For example, 
consider the system 


Xj = G(X) + W(X) , i=1,2, (20) 


where the @; and , are homogeneous polynomials of degree m and m + 2, 
respectively, and a = 0. Hence, f, = 0 in expansion (5). Then, according to the 
formulas of example 1, we have 


bs eeSe, bad. 


1 2 : her ( Oo. (21) 
braze | hatdoms | ftoyexe(2 | 440s, )a 3 


Here the coefficients of the polynomials @; are included in f,; and appear in the 
exponents of e; that is, they do not enter into formula (21) in an algebraic manner. 
Specifically, the conditions for a center (b, = 0, k = 2,3,...) are not algebraic 
conditions on the coefficients of the original system (1). Recall that for an 
elementary singular point the coefficients of the normal form are polynomials in 
the coefficients of the original system, and that the conditions for a center are, 
in that case, algebraic conditions on those coefficients. 

In this connection, we note the assertion of Arnol’d [1970] that the case of a 
center is always distinguished by algebraic conditions on the coefficients of the 
original system (1). In light of the above, we see that this assertion is correct for 
elementary singular points but is not true for non-elementary ones. I!’yashenko 
[1972] gave a rigorous proof of the non-algebraic nature of the conditions for a 
center. He considered a system like (20) where m = 3 and the coefficients of the 
forms @, and @, formed a five-parameter family. 

The conditions of a center are non-algebraic only for coefficients of the forms 
@, and @,. If these coefficients are fixed, then the conditions for a center are 
algebraic for the remaining coefficients of system (1), since they do not appear in 
the exponent in formula (21) (see I’ yashenko, 1972). Apparently, this property 
also holds in the case when the coefficients of the forms @; are not fixed but belong 
to a family with a small number of parameters. It would be interesting to find 
an exact bound on the number of parameters under which the conditions for a 
center remain algebraic. Il’yashenko’s proof tells us only that this bound is no 
greater than four. Perhaps it is even less. 


4.5. Integration of the Truncated System (3) and Computation of the 
Constant a 


If we take u = x{!x, then system (3) is equivalent to the system 


(In’x,) = xf’ '@,(1,u) ? 
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(In'u) = x7 *[G2(1,u)u* — O,(1,u)] « 
This system corresponds to the equation 


dinx, | @,(1,u) 
du G2(1, u) — (1, uu 


= d(u) . (22) 


Thanks to condition (4), the denominator of the right-hand side never vanishes 
for real u, and it is a polynomial of degree m + 1. The degree of the numerator 
never exceeds m. If we express the right-hand side of the equation (22) as a sum 
of simple fractions we obtain 


Ly = 
5 Xe Hee \ 
8(u) = Sih as 
Z op) (aes, va 


where , and 8, are the roots of the equation 


@2(1,u) — ug; (1,u) = 0 


of multiplicity /,, such that 21, = m + 1. We will assume that Im f, > 0. From 
(22), we have 
In x, = J d(u)du , 


where the integral is taken along the real axis. Using expression (23), it is easy 
to express this integral as a sum of elementary functions: logarithms, arctangents, 
and rational fractions (see Khinchin, 1955, §§ 60, 61). Because of its complexity, 
we will not write out the integral in the general case here. We will only calculate 
the change in the value of x, along an integral curve in a single circuit about the 
origin, X = 0. One circuit around this point corresponds to two integrations over 
the real axis in the complex plane of u. Consequently, 


Inx, —Inx? = 2 f d(u)du = 2n , 
Pi 


where the integral represents a Cauchy integral and y is a closed contour 
corresponding to the real axis. According to the theory of analytic functions, the 
evaluation of such an integral reduces to finding the residues of the function d(u) 
(see Privalov, 1960, Ch. 7). In agreement with (23) 


2 | 6(u)du = 201 Y (G1 — G1) = —4n Y Ima, . 
Y k k 
Therefore 
a= -2¥ Ima, : (24) 
k 
Example 2, Consider the homogeneous system 
x, = —X1XZ-—XZ=Q, 


(25) 


X= wx? + (1 + 07)xyx3 —xXZ HQ . 
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Condition (4) is satisfied since the polynomial 


X1 Pz — X2Qy = wW?xt + (1 + w?)xFx5 + x9 


has no real roots. Equation (22) is 


dinx, -w—uWw 


duu (1 + w?)u? + w? = sur (26) 


The roots of the denominator are B, = i = B;,8, = iw = By. fw # +1, all roots 
are simple. Let us assume that 


oa>00F1; (27) 
then expression (23) is 
du) =— 2 + eile eee az 
u-i uti u-iwo utio 


By writing this as a single fraction, with the same denominator as that in 
expression (26), and then equating the coefficients of like terms in the numerators 
of the two fractions, we can obtain a system of linear equations for «,, @,, a, and 
@,. Solving this system yields 


Then the integral of equation (26) is 


Inx, = Rea, In(1 + u?) — Ima, arctgu 
+ Rea, In(1 + w 2u?) — Ime, arctg +e. 
0) 


In agreement with (24), we have 


1 @ 
a= —2(Ima, + Ima.) = 2( + ) 


w—l 2 


r re atl” 


Consequently, by condition (27), the integral curves of system (25) form a focus. 
Theorem 1 informs us that it will remain a focus, even if terms of order greater 
than 3 are inserted into the right-hand sides of expressions (25). 


Exercise 1. Calculate the value of a for system (25) if @ = 1 and evaluate the 
integral explicitly. 
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4.6. Normalization in Complex Coordinates 


We introduce complex coordinates 


2, =X, + ix, , Zz =X, —ix,, (28) 


1 
1 = 5 (21 + 22) , x2 = 


Thus, the complex-conjugate coordinates z, = Z, correspond to the real co- 
ordinates x,, X,. System (1) takes the form 
2, = Q + ig, = (21,22) , 
(29) 
22 = 1 — iP, = O(22,2;) - 
We make the power transformation 
v=2,2,'. (30) 
and system (29) becomes the system 
Z, = (21,20) , 


b= v(Z2'@(Z2,21) _ 9(Z1,22)Z3") ’ 


which corresponds to the equation 


dinz, vp . 2 k 
ch ee , 31 
dinv — @(2,v,2,) — vp(z1,2,0) Dor e) 


where the ¢, = )/2_., €,v' are Laurent series, and the exponents k, / are integers. 
This equation can be put into normal form with a coordinate change of the form 


0 2) +0 
z=w > owt=w Yowk YC! (32) 
k=0 K=0 1S 
Under transformations (28) and (30), the singular point X = 0 is blown up into 
the unit circle |v| = 1,z, = 0, and a circuit about the point X = 0 in the RZ plane 
corresponds to a double traverse of this circle. The Laurent series €,(v) and ¢,(v) 
converge in some ring 1 —e< |v] <1 +e. 
The complex coordinates can be associated with polar coordinates in the 
following manner: z, = re", v = e~?'®, The Laurent series in v are just Fourier 
series in 20. For the truncated equation we therefore have 


dinz, dinr + id@ _ jdinr ee Mi Bact Ho) 
dine 2. ae 
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where g(v) is a Laurent series with no constant term. In the standard manner, 
we can show that under the transformation (32), equation (31) is carried into the 


normal form 
foo} 


one oo Spun! 


=0 [=-0a 


consisting solely of resonant terms, for which 
kA+1=0, (33) 


where A = —4 + $i (see Bruno, 1971, 1972a, § 11; 1970b). 
Ifa #0, then the only real solution of equation (33) is the trivial one k = | = 0, 
and the normal form is linear: 


dinw _ 
dinv 


Ifa = 0, then integral solutions of equation (33) have the form k = 21, and the 
normal form is 


dinw 1 
= —.+ Y yv'w?! (34) 
- dinv 2 2 
r (for p = w/v) : 
dinp 
an Yue” . (34’) 
t=1 


The reality relations for coordinates v, z, are =v ' 


conditions for equation (31) are 


,Z, = vz,.So the reality 


Coa Cop = el ert Cen = 0, K+ | #0. 


As in theorem 3 of section 1.9, we can prove the existence of the normalizing 
transformation (32) with ¢, , = ¢,.,-, which preserves both the reality relations 
and the reality conditions. That is, in the normal form (34) all the coefficients y, 
are pure imaginary. Since Inv = —2i6, then the normal form (34’) must coincide 


with the normal form (14). Hence, in the normal form (14) 


"he 0, k even , 
Mt iy iih vg OGM « 


That is, the right-hand part of equation (14) contains only odd powers of p. 


4.7. Classification 


We will confine ourselves here to the following classification of the cases in 
which the problem of distinguishing between a center and a focus arises for 
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system (1): 

1. The Newton open polygon / consists of a single edge J") with unit vector 
R = (r,,1rz), where both r, and r, are odd integers. 

2. The open polygon /’contains only one edge 1”), the unit vector R of which 
has an even component. 

3. The open polygon / has more than one edge. 

We consider these cases in order below. 

1. After the introduction of new coordinates y,, y, according to the formula 


ios, —_ ym 
Xi; =i > X2=)2 >, 


where m = |r,| and n = |r,|, and the change of time variable dt = y7"'y3"? dr, 
system (1) becomes 
dy,/dt = y2@,(yt,y2) 


dy,/dt = yt '@,(yj,y2) - 


The right-hand sides of the truncated form of this system are homogeneous 
polynomials, and condition (4) is satisfied for this system. The analysis of such 
systems is given in previous sections. Lyapunov [1935b, sects. 9-20] examined 
some subcases with R = (n, — 1), introducing new special functions. 

2. In this case, the truncated system always has a center since its integral 
curves are symmetric with respect to one of the coordinate axes. Indeed, if r, is 
even, the truncated system transforms into itself under a change of the sign of 
x,. This case reduces to the case examined in sections 4.1-4.3 under the coor- 
dinate change 


(eager ee ht a 
2(v + ./u? + v2)" 


where m =|r,| and n =|r,|. Switching to polar coordinate r, 0 (u = rcos 6, 
v = rsin@), we obtain an equation (5) which has all the necessary properties. By 
theorem |, it can be transformed into a normal form with a = 0. 

Lyapunov [1935b, sects. 9-20] again considered some subcases with R = 
(n, — 1) using special functions. In particular, in section 19 he calculated the 
value of b, in the normal form. It turns out that b, depends algebraically on the 
coefficients of the right-hand sides. Apparently, this is a consequence of the fact 
that the coefficients of the truncation depend only on one parameter. 

3. At this time, no example of this case has been studied. The general approach 
of the local method, consisting of solving system (1) separately in each set UE) 
and the subsequent “sewing together”, can give an answer here. But questions of 
sewing the parts of the solutions from the different sets W{(e) have not been 
sufficiently worked out (see section 3.4). This case to some degree resembles the 
problem of investigating the neighborhood of a closed integral curve containing 
one or several singular points. The investigation of this situation is an important 


xX,;=u", X» 
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problem of the theory of differential equations. Similar problems, with a large 
number of variables, arise in celestial mechanics. As an example, we can mention 
the determination of periodic solutions of the second kind in the restricted 
three-body problem. 


Remark. There is a large literature in which conditions for a center are derived 
for these and other particular cases (see, for example, Sadovsky, 1976, 1978). 


Exercise 2, Explain whether a focus or a center results when we take |a| < 2 
in example 2 of section 3.5. 


4.8. The Neighborhood of a Periodic Solution 


Let the system 
X; = OAX1,X2) , i= 1,2 (35) 


be analytic in some region Y, and suppose there is a periodic solution ./@ of period 
T 
x, =x) =x(t+7) , i=1,2, 


where the functions y; are analytic. In a tubular neighborhood of the periodic 
solution .@ we introduce new coordinates r, t by the formulas 


Xx, = x10) — ro2(x1 (0), X2(0)) , 
(36) 


X2 = X2(t) + 7A (11 (0, 429) - 


Here r measures the distance from the point x,, x2 to the periodic solution .# 
along a normal; if r = 0, then x,, x, is on 4. Transformation (36) is one-to-one 
in the tubular neighborhood |r| < ¢ for some ¢ > 0. In fact, the Jacobian of 
transformation (36) is given by 


A= ~9} — 03 — (G19. — G20) 


where ¢; and @; are calculated on .@. As long as there are no stationary points, 
then vy? + o? 4 Oand the function |@, 9, — @2¢,| is bounded from above. Thus, 
4 #Ofor 

|r| < min(g7 + ¢3)/max|, 92 ~ $29| - 


After inserting (36) in system (35) we obtain an equation for r: 
F=ro(r,t)=ro(r,t +1) . (37) 


After the substitution @ = 2zt/t, equation (37) takes the form (5). All of the results 
of sections 4.2 and 4.3 apply; in particular, theorem 1 applies. If a # 0 in the 
normal form (14), the periodic solution is a structurally stable limit cycle which 
is stable if a < 0 and unstable if a > 0. If a = 0, but b(p) $ 0, then the periodic 
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solution .#@ is a non-structurally stable limit cycle. Its stability is determined by 
an index / and by the sign of the first non-vanishing coefficient b, of the series 
b(p) = bp' + ---. If lis even, the limit cycle @ is unstable; if / is odd and b, > 0, 
it is again unstable; if, however, / is odd and b, < 0, then it is stable. In this setting 
the limit cycle is said to have multiplicity / — 1 since no more than / — 1 limit 
cycles can be generated by small changes in the equations. Finally, if a = 0 and 
b(p) = 0, then there are other periodic solutions near .@. All of these are the level 
curves of some analytic integral. 

These results follow from the arguments in section 4.3. We note that the 
coordinates r, t or r, 6 are “local coordinates for the periodic solution .@”. Local 
coordinates are introduced for the general case in section 4.1 of Chapter ITI. 


Chapter III 
The Normal Form of a System of n 
Differential Equations 


§1. The Normalizing Transformation 


1.1. Statement of the Problem 


In this chapter, we will examine an autonomous system of ordinary differen- 
tial equations in the neighborhood of a stationary point. By means of a parallel 
translation, this point can always be taken as the origin of coordinates. Thus, 


our system is 
X = AX + G(X) , (1) 


where X = (x,,X2,...,X,) is an n-dimensional vector variable, A is a square 
matrix of order n, and © is a vector function analytic at the origin X = 0, the 
Taylor series of which contains neither constant nor linear terms. Generally 
speaking, the original coordinates X are not the most convenient for investi- 
gating the properties of system (1) near the fixed point X = 0. The question then 
arises: is it possible to simplify system (1) through an invertible change of 
coordinates which maps the stationary point onto itself? That is, what is the 
simplest system 

Y=CY+ WY) (2) 


to which system (1) can be transformed by a change of coordinates 
X=BY+4(Y), (3) 


which is invertible at the stationary point X = 0 and maps it onto the point 
Y = 0? In this context, we can take transformation (3) in one of several classes: 
formal, analytic, smooth, etc. We will consider here only formal and analytic 
coordinate changes (3), where B is a non-singular matrix and 5 consists of power 
series with no constant or linear terms. Let us clarify this. Every function f(X) 
which is analytic at X = 0 can be expressed as a Taylor series 


F(X) = Y fox? , (3’) 


where Q = (q1,42,--.54,) and X2=x#?!...x%", This series converges ab- 
solutely in some neighborhood |x;| < p (i = 1,...,n), and, in this neighborhood, 
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the value of the sum coincides with the functional value f(X). The coefficients of 
this series satisfy an upper estimate 


fol <p tree ay (4) 


where c is some constant. If the coefficients of the series (3’) do not satisfy such 
an estimate (4) for some positive c and p, then the series diverges in every 
neighborhood of X = 0, and, generally speaking, it is not clear to what function 
the series can be said to correspond. Power series, for which we make no 
assumptions about their convergence, are called formal series. We often turn to 
formal series for help in finding the analytic solutions of these and other prob- 
lems. We begin by finding a formal solution in the form of a power series, and 
then determine the region in which this series converges and, hence, defines an 
analytic function. Finding solutions in the class of formal series can be considered 
to be an independent problem. Even in cases when these series diverge, they may 
be the Taylor series of infinitely differentiable functions. Arithmetic operations, 
differentiation, and the substitution of one series in another are all defined for 
formal series, just as for convergent series. 

Thus, in § 1 of the chapter we will consider the problem of simplifying system 
(1) with the help of formal changes of coordinates (3). Later (in §3), we will 
investigate the convergence of these transformations. 

Before we try to simplify the non-linear parts of system (1), let us simplify the 
linear part with a linear change of coordinates: 


X = BY. (5) 


1.2, Linear Systems 


First, let us briefly recall what we know about Jordan matrices. A Jordan 
block is a square matrix of the form 


fae 0° Onc 20. 008 
¢ 4,0 .. 0 0 
0 ci% .. 0 0), 
\o 0 0 oy, 


where each element along the principal diagonal is the same number 4, and the 
elements immediately below the diagonal are all equal to o # 0; all other ele- 
ments are zeros. A Jordan matrix is a block-diagonal square matrix 


in which the Jordan blocks J,, ..., J, lie along the principal diagonal and all 
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elements outside of these blocks vanish. The number o is the same in all the 
blocks. 

As we know (see Gantmacher, 1959), any matrix A is similar to some Jordan 
matrix J; that is, there exists a non-singular matrix B such that J = B7~!AB. The 
matrix J is called the Jordan normal form of A; the numbers along the main 
diagonal of T are the eigenvalues of A, 4,,..., 4,, and o is an arbitrary non-zero 
number. 

Every Jordan matrix C = (c,;) has the following two properties: 

a) it is lower-triangular; ie., c,, = 0 if i < j; 

b) c;; = 0 if A; # A;, where A, = c,, are the elements of the main diagonal. 
Hence, every matrix A is similar to a matrix C with these properties. 

Note that transformation (5) takes system (1) into system (2), for which the 
matrix C is similar to A: 

C=B'AB. 


It is therefore always possible to transform system (1) into system (2), where the 
matrix C possesses properties a) and b). In our further discussions, we will assume 
that we have already performed a linear transformation such that the matrix A 
in system (1) has these properties. 


1.3. The Existence of the Normalizing Transformation 


We will now investigate system (1) by using formal nonlinear changes of 
coordinates (3), where B is the identity matrix. Such a coordinate change will 
transform system (1) into system (2), where C # A. By A = (A,,A3,...,4,) let us 
denote the vector of the elements of the main diagonal of A. We set 


YW = yi9glY) = Vj > 9jioX? , j=l,...,n, (6) 
QeN,; 


where Q = (q,,42,---,4,) and Y@ = y?#... y4". Since y,g; are power series with 
no constant or linear terms, then 


N; = {integral Q:q,; > —1, q, > 0(k # j), |}Qi] > 0} , 


where ||Q|| = q, + q. ++: + q,. We will write N = N, U-:-UN,,. 
Formal system (2) is called a normal form if 
1) the matrix C possesses properties a) and b); and 
2) in the notation of (6), all the g;g = 0 for <Q, A> # 9, that is 


W=y, Y ge’, P= Vyeiay Bu 
<@,A>=0 
Let us call those terms yj,gjo Y2 in expression (6) for which <Q, A> = 0 the 
resonant terms. Thus, a normal form is a system (2), where the nonlinear part of 
the right hand side consists solely of resonant terms. 
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For the remainder of this chapter, we will employ the following notation: 
p(X) = xifi = xd) fioX® , 
WAY) = vidi = Vi DL GigY® . 
E(Y) = yihi =y; Vig ¥? , 
ni(Z) = 2d; = 2; > digZ® . 


Symbols from one side of those equations will be carried over to the other side, 
€.2., Wi = yigi, etc. Also we will write 


Fy = (fige--«>fng) » F = (fisrosda) = 2 FoX? 5 


and we define G and H similarly. We will also use ¢;, for the original system (unless 
noted to the contrary), W; for the normal form, and €; for the normalizing 
transformation. 


Theorem 1. There exists a formal transformation 
x= y+ E&(Y), i=1,...,n (7) 
under which the formal system 


X= Aix; + Y ayx; + OX) , i=1,...,n (8) 
i>j 


becomes a formal system 


Vix Ay + ¥ ayy; + W(Y) > i=1,...,n, (9) 
i>j 
such that gig = 0 if <Q, A> #0. Here the coefficients hig for <Q, A> = 0 may be 
given arbitrarily; then the remaining hig and the gig are uniquely defined. The ¢;, 
@;, and W; are power series with no terms of less than second order. 


Proof. Transformation (7) carries (8) into (9) if the following equations for 
formal series in Y are satisfied: 


v O(y; + €i) 


[ Oy; 
jal Yj i. j>k 


(ay, ar y Gin Va + v,) = diy, + Aisi t >. ayy; + y, aiySj 
: ij Pj 


+ @o(Y + &), i= 1l,....n. 


These equations are obtained from system (8) by expressing X in terms of Y 
according to formula (7) and substituting for j; according to formula (9). Col- 
lecting and rearranging terms, we obtain 
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“Oh; “1 Gh; 
Vidi t yi) s—AY= hi > Guy) — hiviGi— Vi) a Dany, 
2 oy; SPI » Teel 2 by; S. rik Ik 


i>j jrk 
*, oh; (10) 
—y, D ay, 29 + 2 a,jy;h; 


+ OAV FV... Vn + Val) i= l,...jn. 
Writing out the coefficients of y, Y@ in the i-th equation (10), we get 
Jig + hig<Q, A> 


mt y hig-r,+£,%j 7 \: hipGir 
j P+R=Q 


i>j 


= > Y hio-n,+8,(9j + Yay — » hip > PjGir 


j=l j>k P+R=Q 


+ >, Ajhjg-£,+6, + {eto , Qe N; ; t= lyin 
i>j 


Here (and later), E, denotes the j-th unit vector, and {@;}g is the coefficient 
of y,Y® in the series g,(Y + =). The system of equations (11) is equivalent to 
system (10), since, when Q runs through the set N,, the product y,Y® runs 
through all products of non-negative powers of y,,..., Vq- 

The set of n-dimensional real vectors can be totally ordered accordingly: a 
vector P precedes a vector Q if the first non-zero difference from the following 
set: Ol — IPI], 41 — Pi, 42 — Pas -+-> In-1 — Pn-1, iS positive (recall that ||Q|| = 
4; + °°: + q,). Clearly, only a finite number of the vectors in N can precede any 
Q EN. It is easy to see that, in the right-hand side of (11), only those h;p and gj 
appear for which vectors P and R precede the vector Q. This is correct for the 
Ist, 3rd, and 5th summations in the right-hand side of (11), since the vector 
OQ — E, + E; precedes the vector Q if j < i; it is true for the second and fourth 
summations because only those P and R are used for which |/P|| + ||R]| = ||Q|l 
and ||P|j, ||R|| > 0; hence ||P||, ||R!| < |Q|). Finally, {¢,}9 contains only those h,p 
that ||P|| < ||Q|l, since the series ~,(X) have no linear terms. 

Equations (11) are satisfied when ||Q|| = 0, since equations (10) have no linear 
terms. For ||Q|| > 0, equations (11) will be satisfied if we choose 


Jig=9, hig = <0, A>“! cig for <Q,4> £0 ; 
Gio = Gig » hig — arbitrary for <Q, A> =0 ; QeEN, , Pade: ceo 


Here, cjg denote the right-hand sides of equations (11). Thus, the gig and hig 
(i = 1,...,n) are defined in the above ordering of Q in correspondence with the 
assertion of the theorem. FJ 
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A normal form can be found even for non-autonomous systems. If the time 
dependence is periodic or quasi-periodic, then the transformation of the system 
to normal form corresponds to the transformation of an autonomous system to 
normal form in the neighborhood of a periodic solution or a family of condi- 
tionally periodic solutions (see §§ 4 of chapters HI and IIT and Bruno, 1971, 1972a, 
§ 11). Kostin [1973, 1975] and Sibuya [1958] have treated the normalization of 
general non-autonomous systems. 


1.4. Transformations of the Normal Form 


As theorem | suggests, the normal form is not uniquely determined by the 
original system. Suppose that some system (8) is carried by one transformation 
to normal form (9) and by another transformation to the normal form 


2, = Az, + Dayz, + P(Z), i=l,....n. (12) 


i>j 


Then the superposition of one transformation with the inverse of the other will 
take normal form (9) into normal form (12). Therefore, if we study all transforma- 
tions of this last type, we will also know all the transformations of the original 
system to normal forms. 


Theorem 2. If, under the transformation 
yi =n(Z) , i=l,...,n (13) 


normal form (9) is transformed into normal form (12), then dig = 0 for <Q, A> #0 . 


Proof. The proof is similar to the proof of the preceding theorem. In particu- 
lar, we obtain this system of equations connecting g, g, and d: 


dig<Q, A> ar Y dig-£,+£,4y a} Y dipGir 
i>j P+R=Q 


i>j 


= d, x dig, +6,(9; + 1)aj, 
J71 i (14) 


P+R=Q j=1 Ve i>j 
+ Wiso, QeEN, , aed eee (ae 


Just as in the preceding proof, we introduce a total ordering of the vector indices, 
and we note that, in the right-hand side of (14), only those d;p appear for which 
P precedes Q. By induction on the vector indices, we shall show that 


dp=0, if{P,A) #0. (15) 
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The smallest P e N, according to the ordering used here, is P = E, — E,. This 
vector appears only in N, and does not appear in the remaining N;. The right 
hand side of (14) vanishes for this vector index, and these equations take the form 
dig,-2,<E, — E:1,A> = 90. This implies (15) for the smallest P. Now let (15) be 
satisfied for all P preceding some vector Qe N, and let <Q, A> #0. We will 
show that the right-hand sides of the equations (14) vanish in this case. Since the 
matrices of the linear parts of systems (8) and (13) possess property b), then a, # 
O only if A; = 4,, (that is, (E;,4> = <(E,,4>) but then (Q — FE, + E,, A> = 
<Q, A> # 0 and djo_¢,+2, = 0 by our inductive assumption. Therefore the first, 
third, and fifth summations in the right-hand sides of (14) must vanish. If P + 
R = Q, the product dipgjz = 0, since <P, A> + (R, A> = (Q, A> # 0; Le., at least 
one summation in the last sum is non-zero and, consequently, at least one 
of the factors in the product d;pg;z is zero. Therefore the second and fourth sum- 
mations in (14) vanish. It remains to be shown that the coefficient of y, Y2 in the 
series 


Vidig(V141)--->Vndn) = Vid; > Gis dp indy 
ry 
Si DG dade i,20, i=1,...,n. 


vanishes. From the last expression, it should be obvious that this coefficient is a 
sum of all terms such as 
Gis4 1p, dap. > (16) 


for which S + P, +--+ P, = Q. Since (Q, A> # Oand <S, A> = 0, then (P;, A> # 
0 for at least one P,. In product (16) all the vectors P; precede Q and, according to 
our inductive assumption, the corresponding factor in (16) vanishes. This means 
that the term {;}g also vanishes, and we obtain djg(Q, A> = 0. Hence, dig = 0 
f<Q,4> #0.0 


Example 1. The following question arises in connection with Theorem 2. Is 
it possible to simplify further the normal form by means of the transformations 
of theorem 2? We show here that it is not always possible. 

The system 


(17) 
Xn = mx, + W(%4,-.-5Xq-1) ’ 


where w is a homogeneous polynomial of degree m > 1, is a normal form. Here 
A =(1,1,...,1,m), and the equation 


(0,40 =4,+°° +4-1+4,m =90 (18) 


has exactly two kinds of solutions Q € N: 
Iq, =0,Q0 = E; — £;,i,j = | Frees 1 
TN. gy = 1,91 +42 +0 + Gnd = Ii Gna 2 O. 
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Writing system (17) in form (8): 


X; = 1x; , i=l,...,.n-—1, 


A \ 
2 5 = 
a ha a ee ee as 


we see that (18) is satisfied. According to theorem 2, all transformations which 
carry system (11) into other normal forms have the form 


j=l (19) 
Xn = CV, * C(Vis+++>Yn-1) > 


where c det(c,;) # 0 and € is a homogeneous polynomial of degree m. Under such 
a transformation system (17) becomes the system 


V=)i 5 VHA) cee hs 
jp, = cX, — cE = cx, + ow — come 
(20) 
= my, to lmE + c'W(x1,...,X_-1) — @ ime 


= my, + W(V1s-++sYn-1) : 


We have used here Euler’s formula for the homogeneous function ¢. System (20) 
differs from (17) only in the homogeneous polynomial 


WY 5--+5Yn-1) = CTW(X1,---.Xn-1) » 


which is obtained from the polynomial # by the linear transformation 
n-1 
x =o" Y cyy, i=l,...,n—-1. 
J=1 


Hence, the problem of determining the complete system of invariants of system 
(17) with respect to formal transformations reduces to the problem of finding the 
complete system of invariants of an (n — 1)-dimensional form of degree m with 
respect to linear transformations. The same question arises in the reduction to 
canonical form. Considering how little is known about algebraic invariants in 
the general case (see Gurevit , 1948), theorem 1 appears to be the best way to 
obtain general results on the simplification of system (8) with formal transforma- 
tions. This is why we introduced normal forms, which give neither a full classifica- 
tion of systems nor a complete system of invariants; their normality appears in 
the normality of the vectors Q and A. 

By constructing similar examples it is possible to show that the problem of 
determining the invariants and the classification of formal systems in the class 
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of formal] transformations includes fundamental problems of the theory of alge- 
braic invariants. But in some special cases it is possible to obtain further simplifi- 
cations of the normal form, and even to find all of its invariants. This will be 
treated in § 2 of this chapter. 


1.5. Distinguished Transformations 


As theorem | states, the normalizing transformation (7) and the normal form 
(9) are uniquely determined only after the choice of the resonant coefficients h,o 
(for <Q, A> = 0) in the normalizing transformation. In particular, all of the 
resonant Coefficients h,g can be set equal to zero. Such a normalizing transforma- 
tion (9) is unique; we will call it distinguished. It turns out that many properties 
of the original system are carried over into the normal form if the resonant 
coefficients are chosen in the proper manner. There will be many examples of 
this in this chapter. Here, let us show that the “triangularity” of a system can be 
preserved under a normalizing transformation. 


Theorem 3. Let the series Q,, ..., Qm in System (8) depend only on x1, ..., Xm 
(m <n). Then the series €,,..., €,, in the distinguished normalizing transformation 
(9) are independent of y,,+1,---> Yn, and the transformation 


X= Vet CVise-s Vm) i=1,...,m 
takes the subsystem 


X= Aix, + YS ayxj + P%15---.Xm) i=1,...,m 
i>j 
into normal form. 


Proof. The proof is a continuation of the proof of theorem 1. The condition 
of the theorem implies that 


fig=9,  ifldmealto+lal#0, i=1,...,m; 
hg=0, if€Q,4>=0, i=1,...,m. 


We must show that hp = g;p = 0, if [pai] + °°: + |p,| #0,i = 1,..., m. Let this 
be true for all P which precede the vector Q. We will show this for all Q such that 
\@m+il + °°: +1q,| #0. We will successively show that all summations in the 
right-hand side of equation (11) vanish for i < m. 

1) Since i < m, then the vector P = Q — E, + E; has 


Pu= Wk > kK=m+1,....n, 


and, by our inductive assumption hig-£,+8, = 0. 
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2) P+ R=Q, whence [Pmsi| t°°* + [Pal + [marl #07 + Ital 2 ldmaal +7 
+ |q,| > 0; then either |p,,4,| + °°: + Ip,| #0 or Irnei| + °° + [rl # 0, or both. 
In any case, h;pgip = 0 by our inductive assumption. 

3) For j < m, just as in 1), hyg_z,+2, = 0. For j > m, the jth component of the 
vector P = Q — E, + E,is p; = q; + 1. 1fp; #0, then |p,,4,| + °°* + |p,l # 0, and 
by our inductive assumption hig_z,42, = 0. Thus, (4; + !)hig-z,+2, will always 
vanish for j = 1,..., n. 


4) See 2). If Past ="-° = Pa =O then Oa PrGkr = py a Pe9cx, but then 
Itmeil too +r, #0 and g.p =" =9nr =O by our inductive assumption. 
And if |Pmiil +°°* + |Pal #9, then hip = 0. 

5) See 1). 


6) We are left with a sum of terms of the form 
{1 + hfs YSU + Ay)... + Ay) Yo (20’) 


If [Sno1l1 + °° + |s,| 40, then f,, = 0 by the condition of the theorem, and such 
terms vanish. If s,,., =°': = s, = 0, then (20’) is 


{(1 + hi) fis YS + hy... (1 + hy} 5 
which is a sum of terms of the form 
Fish, por. hj poo 5 fissisodg Se ™ , 


where Q = S + PO) +--- + P. Hence, 


k 
l@mail #7** + lanl < © (RAG Fe BR 
t=1 


Therefore (p®,,| +--+: + |p?| #0 for at least one P®, and hj, p. vanishes by 
inductive assumption; that is, the terms (20’) vanish. Consequently, all the terms 
in {(1 + h,)f;}9 are equal to zero. Thus, we have shown that 


gig t hig6Q,A>=0, i=1,...,m. 


If (<Q, A> 4 0 then gig = 0 by the definition of the normal form, and we see that 
hig = 0. If <Q, A> = 0, then hig = 0 by the condition of the theorem, which 
implies that gig = 0. The induction begins with the linear terms, for which the 
assertion of the theorem is obvious. 1 


Remark. At the same time, we have proven a slightly broader statement than 
that of the theorem. Specifically: Let the series g,, ..., 9,, in the situation of 
theorem | depend only on x,,..., X,,, and let the resonant part of the normalizing 
transformation have similar properties (i.¢., the resonant coefficients hjg vanish 
ifi< mand q,.4,; ++: + 4, > 0). Then the series €,,..., ¢,, and W,,..., W,, are 
independent of y,,415---5 Yar 
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1.6. Small Parameters 


Suppose for the system 
Xi = P(E X) , i=1,...,0 (21) 


with parameters E = (é,,...,€,,), that the point E=0, X =0 is a stationary 
point. Let A =(A,,...,4,) be the vector of eigenvalues of the matrix 60/0X, 
evaluated at £E = 0, X = 0, and let the functions ¢, be analytic at this point, ice., 
they may be expanded into convergent power series in E and X. 


Theorem 4, There exists a coordinate change 
x, = (EY), i=l,....n, (22) 
invertible at the point E = 0, Y = 0, which takes system (21) into the normal form 


W=WEY=y YL gQlEY?, ben. (23) 
<Q,A>=0 


Here the ¢; and w; are formal power series in E and Y, and the gig are power series 
in E. 


Proof. Let us consider the small parameters as variables and add to system 
(21) the system 
é&=0, j=l,...jm. (24) 


In this context, the ¢; can be conveniently treated as variables preceding the 
variables x;. For the combined systems (24) and (21), the eigenvalues of the matrix 
of linear terms evaluated at £ = 0, X = Oare0,...,0,4,,...,A,. We can explicitly 
write the linear part of the combined system: 


E=0, X=WE+AX+-:, (25) 


where the rectangular (n x m)-matrix JJ and the square (n x n)-matrix A both 
have fixed coefficients. A linear transformation in the X coordinates alone: 


X = BZ (26) 


can be used to put the matrix A into Jordan form J = B-'AB. We can index the 
Z variables in such a way that all vanishing eigenvalues A, = --: = A, = 0 appear 
first on the main diagonal of J, followed by the non-zero eigenvalues J,,,,..., 
A,. Then J = {Jo,J,} is a block-diagonal matrix in which all vanishing diagonal 
elements are found in Jy, and all non-vanishing diagonal elements are in J,. If 
we split the vector Z into two vectors, Z) = (2,,...,2;) and Z, = (Z;41,.--52,)s 
then system (25) takes the form 
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E=0, 
Zo = ME+JoZo+°"°. (27) 
Z=ME+ NZ, 4° =SU'ME+Z)+-° 
If we let 
W=Zo, W,=J(M,E4+Z, . (28) 
then system (27) takes the form: 
E =0, 
Wy = ME + JoWo ts, 
W,=J,W4-- 


Recall that all the eigenvalues of J, are equal to zero. The matrix of the linear 
part of this system has properties a) and b) (of section 1.2), and hence theorem 1 
applies to this system in E and W. By that theorem, there exists an invertible 
formal change of coordinates 


EW-4,Y, (29) 


under which system (24), (21) takes on the normal form, 


5=6 YY dip(4)¥2, jal,....m; (30) 
(@,A)=0 

w=M LY giglAY®, i=l... (31) 
<Q,4}=0 


Such a normalizing transformation is not unique. In particular, theorem 3 tells 
us that there always exists a distinguished normalizing transformation, for which 
6; = €}, j = 1,...,m. Under this transformation, system (30) is identical to system 
(24), and system (31) reduces to system (23). The successive application of trans- 
formations (26), (28), and (29) does not change the parameters E£ and carries 
system (21) into the form of (23). J 


Example 2. Let the right-hand sides of system (21) be linear in X: 
@(E, X) = A(E)X + A(E) . 


It is easy to show that the normalizing change of coordinates (22) will likewise 


be linear in Y: 
X = BE)Y + BiE) 


as will be the normal form (23): 


¥=C(E)Y+ CE), 
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where C(E) = B~'(E)A(E)B(E). By theorem 4, the element ¢,,(Z) will be nonzero 
only if A; = 4,; that is, the matrix A(£) is similar to the block-diagonal matrix 
C(£) in which every eigenvalue A, of the matrix A(0) corresponds to one block. 
At the same time, further simplifications inside each block can be obtained by 
similarity transformations (see Arnol’d, 1971a). But these further simplifications 
are not covered by the theorem on the reduction to normal form. Instead, they 
correspond to a secondary normalization (see § 2, chapter ITI). 

Theorem 4 was formulated and proved in an article by Bruno [1974b], where 
one can find further examples of the application of this theorem. In general, if 
we treat small parameters as local coordinates, then the various specialized 
perturbation methods are just variations on the reduction of system (21) to 
normal form (23) (see §§ 3 and 4 of this chapter, as well as Bruno, 1973d, 1979). 


1.7. On the Calculation of the Normal Form 


Many properties of a given system are more easily investigated with the help 
of its normal form. For such purposes, it is usually sufficient to establish general 
properties of the normal form and to calculate a few of its lower-order terms. To 
perform these calculations, it is first necessary to make a linear coordinate 
transformation under which the matrix of the linear part is put into triangular 
form with the property b) (for example, into Jordan form; see Gantmacher, 1959). 
Then we must perform a non-linear change of coordinates in accordance with 
theorem 1. The calculations of the coefficients of the normal form can be carried 
out in the same order as in the proof of theorem 1. 

Generally speaking, these calculations are quite cumbersome. The coefficients 
Jig and hg are polynomials of the coefficients f;p and 6p, where 


(0;  #<P,AY4 0, 
5p = : i“ 
li, if <P,Ad=0. 


It would be convenient to write these polynomials out explicitly for small values 
of n and ||Q||. Starzhinsky [1972, 1973, 1974, 1977] did so in part; however, his 
notation was rather inconvenient, for, it does not facilitate calculations, but 
complicates them and sometimes is the cause of errors. In the same work he 
solved the question of stability for certain mechanical problems with the help of 
the normal forms. But if system (1) has already been normalized in its lowest- 
order terms, (i.e. all the lower-order terms in the right-hand sides are resonant), 
then the calculation of further terms of the normal form can be simplified. 


Theorem 5. For system (8), let there exist some vector K, such that for all terms 
figX? in (8), including the linear terms, we have: 

1)<0,K> 30, 

2) fig = 9, if <Q, K> < s and <Q, A> # 0—that is, system (8) has been normal- 
ized up to terms of degree s (with respect to vector K). 
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Then in the normal form (9), we will have 
Ge=fo, if<Q,K><2s. (32) 


Proof. The proof proceeds exactly as did the proof of theorem 1. We choose 
as our normalizing transformation the distinguished transformation for which 
hig = 0 when <Q, A> = 0. By induction on the ordering of the indices Q, we will 
show that 

hyp = O and g;p = fip , if<P,K><s. (33) 


Let this be true for all vector indices P which precede a vector Q: <Q, K> < s. In 
equation (11), the right-hand side is a sum of terms of the form 


Jinhip and fish; p,--.Rjw, » (34) 


where all the vectors P, R, S, P,, ..., P, precede Q. In agreement with our 
condition, <S, K> > 0 and, by our inductive assumption, <R, K> > 0. Since 


«0, K> = (R,K> + <P, K> = (S,K) + ¢P,,K> +°°°4+ <P, KD <s, 


then <P, K> < sand <P,, K> < sforat least one !in/ = 1,...,k. By the inductive 
assumption, the corresponding coefficient h;p or h; p, must vanish. Consequently, 


Gig + ¢Q, A>hig = Sio : 


If <Q, A> #0, then fig = 0 by the conditions of the theorem, and hig = 0. If 
<Q, A> = 0, then gig = fig. The induction begins with the linear terms, for which 
the assertion is trivally satisfied. Thus, property (33) is proved. 

We now use induction with respect to the order of the vector indices to prove 
property (32). Let this property be satisfied for all those vector indices which 
precede the vector Q: <Q, K> < 2s, <Q, A> = 0. We consider the terms (34) from 
the right-hand side of equation (11). Since <R, A> = 0, then <P, A> = 0. But then 
hyp = 0, since we use the distinguished normalizing transformation. We have 
shown for these terms (34) that 


S,K>20, <P,K>25, 
but 
2s > <Q, K> = <S, K> + (P,,K> a ea <P, K> > sk 3 


consequently, k = 1, and we are left with terms of the form fish;,p, and fig, 
where S + P, = Q. By property (33), <P,,K> 2s, so that <S,K> = <Q, K> — 
<P,,K> < 2s —s =s. By the condition of the theorem, f,, is then non-zero only 
if <S, A> = 0. But then ¢P,, A> = 0, and hj p, = 0 since the transformation is 
distinguished. We thus obtain equation (32). The induction begins with the linear 
terms, for which the assertion of the theorem is trivially true, and proceeds from 
there. 1 
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Remark. In the situation of theorem 5, if the matrix A is diagonal and the f; 
contain no terms of degree less than s, then 


l 
Jig = \ <P, Ay ins® Fp> + fig » (35) 
P+R=Q 
where s < <P, K), (R,K)> < 2s < (0, K) < 3s; (Q,A> =0, <P, A> #0, and 
<R, A> #0. 


Proof. It is evident from equation (11) that 


Jig = Loite ’ 


where {¢;}9 is the coefficient of y; Y2 in the series p{ Y + =(Y)). Let us calculate 
the contribution to this coefficient from the term x; fg X*. We have 


XifiznX*® = y,Y"fin(l + h(L + h,)"...(L + h,)" 
= yiY*fig(il +h, + <R,H>+-°:), 


where the dots indicate terms of degree greater than 2s, since by (33) the expan- 
sions of h, begin with terms of degree s. Since <R, K> > s, we have the following 
expression for terms of degree less than 3s: 


{9i}o = >» Sirthip + <R, Hp>) + fig » 
P+R=Q 


and for terms of degree less than 2s, we have {g;}p = fip. Thus, given the above 
bounds on the vector exponents, we can obtain from formula (11) the equation 


go= > Sirlhiep + <R,Hp>) + fie hipS P, AD = fip - 
P+R=Q 
Since P + R = Q and <Q, AD = 0, then <P, A> = —¢R, A>. Hence 


Sizhiv = Sip fir§P, A> t= —firhir > 


from which (35) follows immediately. 1] 


Example 3. If K = (1,..., 1), then <Q, K> = ||Q||, which is just the usual degree 
of the term fg X®. Theorem 5 asserts that in a system normalized to degree s, the 
calculation of the coefficients of the normal form up to degree 2s reduces to 
isolating the resonant terms (see lemma 3§4 in Bruno, 1971, 1972a). If the 
matrix A is diagonal, then s = 1, and theorem 5 and the remark following tell us 
that g:9 = fig for ||Q|| = 1, <Q, A> = 0; moreover, formula (32) is true for ||Q|| = 
1, while formula (35) applies for ||Q|| = 2 and ||P|| = ||R|| = 1. In order to employ 
this formula, we need (for each Q: Q EN, ||Q|| = 2, <Q, A> = 0) to find all possible 
choices of P and R such that P+ R=Q, P, REN, and |/P|| = ||R|| = 1. In 
particular, for oscillating systems with neutral linear part, this formula helps us 
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calculate the cubic terms of the normal form, which determine the stability and 
other asymptotic properties of solutions of a given system. 

For example, let n = 2 and 4, = —2,, the only vector Q e N with ||Q|| = 2 
and <Q, A> = Ois Q = (1, 1). Possible values of R, P € N are 


R | (2,—1) | (1,0) (— 1,2) 
P | (-1,2)| OD 


(2, —1) 
Formula (35) takes the form 


(0, !) 
(1,0) 


1 1 
= ——f, — t 
Giu1.1) = 3 Fi2,-fc-1,.2) ea Fico fio.) + 7, Fo. nF201.0) 
3A, 1 hy 


2 
ru 37, on nfra.-0 + fia - 


Hence, 


2 1 1 
914,11) = Sia. + Fh Fic-1.2f22.-1) ~ 7, Sua.ofro.1) a 7 fio. yS2u.0) : 
Ay 1 Ay 


If A, is pure imaginary, then Re g,(1,,) is the Lyapunov number; the stationary 
point is stable for Reg,,,,,) < O and unstable for Re g,,,,;, > 0 (compare with § 1 
of chapter II). In agreement with section 1.9 of chapter II, for a given real system 
in complex notation, the reality condition f,(p,4) = fr, p) is satisfied. Therefore, 


Re giajy = Re fics) — ReVl/Ai faofio,v} 


= Re fia. + im{ fig ofio,}/Im/, . 


Let D be some set of vectors in n-dimensional real space R}. Let D, denote 
the set consisting of finite sums of vectors of the set D. If C is another set of points, 
then we let C + D denote the set of sums of a vector from C and a vector from D. 

We rewrite system (8) (including the linear terms) in the form 


X= Xf =x) fioX® . i=l,....n; 


we let Fy = (fig,-.-,frg) and F = Y Fy X°®. Let us denote by D(F) the support of 
F—that is, the set of points Q for which Fo # 0. 


Theorem 5’. In the situation of theorem 1 with a distinguished normalizing 
transformation (9), the coefficients hig and gig depend on a coefficient fp only when 
QeP+D,(F). 


The proof is not given here; it is similar to the proof of lemma 2 in §7 of 
Bruno [1971, 1972a]. It is not difficult to derive theorem 5 as a consequence of 
theorem 5’. In theorem 5’, the normalizing transformation need not be distin- 
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guished. Let H, = (h,4,...,h,4) be the resonant parts of the series h;. Then the 
coefficients h,g and gjg depend on the coefficient /,, only when 


QcP+D,(F)+D,(H,) . 


1.8. Invariance 


In the rest of this part of the chapter, we present the results found in Bruno 
(1975b]. It will be convenient to slightly alter our notation. We will write a given 


system in the form ; 
X = AX + O(X) . (36) 


Let the matrix J be the Jordan normal form of A; then J = B™' AB. If we apply 
the linear coordinate change 
X = BZ (37) 
system (36) becomes 
Z=JZ+@(Z), (38) 


where the Taylor series of the function @(Z) has no constant or linear terms. 
Now we consider a formal, non-linear change of coordinates of the form 


Z=W+5(W), (39) 


where 5(W) = (€,,...,¢,) is a power series in W with neither constant nor linear 
terms. Let transformation (39) carry system (38) into the formal system 


W=JIW+ ¥(W), (40) 


where = (,...,W,) is a power series with no constant or linear terms. We 
denote by A = (A,,...,4,) the vector of the diagonal elements of J. By the A- 
resonant part of the series 5, we mean the series 5, = (€,4,...,&,4): 


bia = Wj DL higW® , 
where the summation is taken over all QO for which 
(Q, A> = aA to + dndn =O. (41) 


System (40) will be a normal form if the series ¥ coincides with its own A-resonant 
part ¥,. According to theorem 1, every system (38) can be taken into normal 
form (40) by means of a formal change of coordinates (39). The coefficients of the 
series 5, may be chosen arbitrarily, but these determine the full series ¥ and = 
uniquely. Note that the series ®, 5, and ¥ are related by the partial differential 
equations 


& 
ty 


Y(W) + ay + Y(W)) = JE(W) + O(W + E(W)) , (42) 


5 
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and theorem 1 becomes a statement about the properties of formal solutions 2 
and ¥ of this system of equations. 
Let a system similar to (38) 


Z=IZ+ &2) (43) 
be transformed by a coordinate change 
Z=W+ EW) (44) 
into the system : 
W=JIW+ HW). (45) 


Let A be the vector of the diagonal elements of the Jordan matrix J. If A # A, 
then the A-resonant part 2, of some series Z and its 4-resonant part 27 may 
contain terms with different exponents. Let us denote by =(W) the series with 
coefficients that are the complex conjugates of the coefficients of the series 2(W). 


Theorem 6. Let system (38) be reduced to normal form (40) by transformation 
(39). In the following three cases we can obtain for system (43) the normalizing 
transformation (44) and the normal form (45) according to the following formulae: 

1) If J=J, and &(Z) = GZ), then E(W) = =(W), BW) = YW), and 
Ex(W) = 5,(W); 

2) If J = 43, B(Z) = 5G(Z) for some number 640 then E(W) = =(W), 
Y(W) = 6¥(W), and E;(W) = =,(W); 

3) If, for some non-singular matrix K, J = K"\JK and @(Z) = K"!®(KZ), 
then E(W) = K*3(KW), ¥(W) = K7!¥(KW), and 27(W) = K'2,(KW). 


Proof. The proof in each case follows this scheme. First, we prove that 
transformation (44) takes system (43) into system (45); that is, we show that the 
system of formal equations 


WwW) toy a aw + &(W)) = JE(W) + B(W + S(W)) (46) 


is satisfied. Then we show that #3(W) = ¥(W) (i.e., system (45) is a normal form) 
and that the series £7(W) is related to 2,(W) in the manner suggested by the 
theorem. We proceed with the separate cases: 

1) If we replace all the coefficients in system (42) with their complex con- 
jugates, then the equality still holds. Consequently, system (46) is satisfied for 
E(W) = =(W) and ¥(W) = ¥(W). In this case, A = A and the set of real solu- 
tions Q of equation (41) is the same as for the equation 


<Q,A> =0. (47) 


Consequently, for every vector power series =(W), the resonant parts 8 E4 and 27 


are identical. Hence the series ¥ consists solely of resonant terms and =z = =, 
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2) Multiplying the equations (42) by 5, we find that the series (W) = =(W) 
and ¥(W) = 6¥(W) are solutions of system (46). Since A = 654A, the sets of real 
solutions of equations (41) and (47) are identical. Consequently, for every vector 
power series =(W) we have 2, = £7. Therefore the series 6¥ consists solely of 
resonant terms, and £3 = &y. 

3) If we make the coordinate change W = KW in system (42) and then 
multiply the system on the left by the matrix K~!, the equality is preserved. 
System (46) therefore has solutions 5(W) = K71=(KW), ®(W) = KY(KW). 
Let =(W) be some vector power series and let =(W) = K~1=(KW). We will now 
show that 

E(W) = K'E,(KW) . (48) 


Let Land £ be the diagonal matrices formed from the diagonal elements of the 
Jordan matrices J and J, respectively; that is, L = diag A and L = diag 7. By the 
conditions of the theorem, JK = KJ(K = (k,;)). According to Gantmacher [1959, 
ch. VIE, § 1, thm. 1], k,; = 0 if A; 4 4;. Hence, 


LK=KL. (49) 


Let Z(W) and H(W) be vector fields. Their commutator (or Poisson bracket) 
is the vector field 


az, all, 
dw, dw, 7 : 


@(W) =[Z,H] =) ( 

j=l 

The commutator is linearly dependent on each of the fields Z and H, and is 
invariant under a change of coordinates. For example, if W = KW, then 


K~'@(KW) = [K7'!Z(KW), KH(KW)] , (50) 


where the commutator is calculated in the W coordinates. Note that for ¢; = 
wf;W2, i= 1, ..., n, where (f;,...,f,) is a fixed vector, we have [Z, LW] = 
Z<Q, A>. Therefore the resonant part Z, of a series Z is distinguished by the 
property [Z,, LW] = 0. By (49) and (50) 


[Z, LW] = [K"'Z(KW),K'LKW] =[K™'Z(KW),LW] . 


That is, the 7-resonant terms of the vector series K~!Z(KW) correspond to 
the A-resonant terms of the series Z, but non-d-resonant terms of the series 
K~1Z(KW) correspond to non-A-resonant terms of the series Z, and conversely. 
Consequently, the resonant part =7 corresponds to the resonant part 24, and 
statement (48) is satisfied. 

The following equations are implied by (48), 


P5(W) = K'°,(KW) = K 'WKW) = WW) , 
E,(W) = K12,(KW) , 
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that is, system (45) is a normal form, and the resonant part of the normalizing 
transformation (39) determines the resonant part of the normalizing transforma- 
tion (44). The theorem is proved. 0 


1.9. Real Systems 


Suppose that system (36) is real; that is, 
A=A, O(X)=O(X). (51) 
Then for the Jordan form J we can choose a block-diagonal matrix 
J ={Jy,Jy,J5} (52) 


where J, = J, is a Jordan matrix of order / and J; = J; is a Jordan matrix of 
order m. Clearly, 21+ m=n. The eigenvalues 4,, ..., 4, form the diagonal 
elements of J, with 4, = 4,,; (i = 1,...,1) as the complex eigenvalues and /5,,,, 

., A, as the real eigenvalues. The sub-diagonal elements are either zeros or some 
real number o, which can be chosen arbitrarily. We denote the k x k identity 
matrix by E™. 


Lemma 1. If the matrix A is real, then there exists a transformation (37) to 
system (38) with matrix (52) such that 


B=BI, (53) 
where I is the block matrix 
0 EE 09 \ 
I=([E® 0 0 (54) 


\O 0 EM} 


The proof follows immediately from theorem 28 in § 34 of Pontryagin’s text 
[1961], according to which there exists a basis, in which the matrix A has a 
Jordan form, which differs from the complex conjugate basis only by a permuta- 
tion. The matrix I gives this permutation for the Jordan form (52) (see also 
Hartman, 1964, Ch. 4, §9). 0 


Note that 
P=EO, Fal, THI, Jal. (55) 


We now introduce the matrix 
Eo iE oO \ 

C=|E® —iE® 0 
\ 0 0 E | 


(56) 
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for which by (54) * 
C=I1C. (57) 


Therefore, and by virtue of (53) and (55), the matrix D = BC must be real, since 
D = BC = BIIC = BC = D. Consequently, the transformation X = DY is real. 
Here, the real coordinates Y are related to the complex coordinates Z by the 
standard transformation 


Z=CY. (58) 


This means that if the X coordinates are real, then so are the Y coordinates, while 
the Z coordinates satisfy the reality eae Z = IZ (see Hartman, 1964, Ch. 6, 
§9). Note that in system (38), ®(Z) = B-10(BZ). as and according to (51) 
and (53) we must have 17! @(1Z) = 1-1B-'!O(BIZ) = B™'O(BZ) = O(Z), ie., 


I'@(IZ) = O(Z) . (59) 


Now let system (38) be transformed into system (40) by the transformation 
(39). We perform the standard transformation W = CV with matrix (56). We will 
show that under the condition 


IEW) = 5(W) (60) 


the transformation from Y to V is real. In fact, this transformation is Y = V + 
C7'E(CV). By (57), (55), and (60), 


CE(CV) = C5(CV) = CIEUICV) = CE(CV) . 


Theorem 7. Let system (38) be transformed into the normal form (40) by the 
coordinate change (39), and let property (59) be satisfied. Then property (60) will 
also be satisfied, if the resonant part = ,(W) of the series 5(W) is such that 


TNE AUIW) = E4(W) . (61) 
In particular, the property (60) is satisfied for 5, = 0. 


Proof. The proof consists of the repeated use of theorem 6. Since I"'JI = J is 
a Jordan matrix, then the third statement of theorem 6 with K = I applies to 
system (38). Applying as well the first statement of that theorem and the property 
I = I, we find that system (43), where J = I7'JI and @(Z) = I7'B(1Z), is trans- 
formed into the normal form (45) by transformation (44), where E(W) = 
IEW). In addition, 5,(W) = 17'2,(IW) and P(W) = I-' ¥(IW). But by 
assumption, properties (59) and (61) are satisfied. Hence J = J, 6 = @, and 
2, = £4; that is, systems (38) and (43) coincide, as do the resonant parts of 
transformations (39) and (44). By theorem 1, such normalizing transformations 
coincide, and 
YW) =1 PUW) , (62) 


that is, (60) is satisfied. The theorem is proved. J 
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This proof is a generalization of a proof given by Siegel [1956, §14] for a 
more specific class of systems. 


Example 4. Let n = 2. If the eigenvalues J, and A, are real, then J = E®, and 
system (38) and its normal form (40) are both real systems. If the eigenvalues are 
pure imaginary, then A, = 1, = —A,, and I =({ 9). The reality condition (59) 
for system (38) is @,(2,,22) = @2(Z2,2Z,). By theorem 7, the normal form (40) can 
be chosen such that an analogous relationship (62) for ¥ is satisfied. 


1,10. Linear Automorphisms 


Suppose that, under the linear transformation 
X=MX, t=6, * (63) 


where M is a non-singular matrix and the number 6 is non-zero, system (36) is 
transformed into 
dX /dt = AX + 6(X) . (64) 


If, under the change of variables, X + X,f— t given by (63), system (64) becomes 
system (36) (that is A = A and 6(X) = @(X)), then transformation (63) is called 
a linear automorphism of system (36). If system (36) has an automorphism (63), then 
system (38), obtained from (36) with transformation (37), has an automorphism 


Zane. Gemor, (65) 

where N = B-' MB; that is, 
6NJIN=J, (66) 
5N~!@(NZ) = G(Z) . (67) 


Theorem 8. Let the coordinate change (39) transform system (38) into the normal 
form (40), and let system (38) have an automorphism (65). Then the normal form 
(40) has the automorphism W = NW, t = 6t, if, in the normalizing transformation 
(39), 

N12, (NW) = £4(W) . (68) 


In particular, this is true for 2, = 0. 


Proof. We apply the second assertion of theorem 6 to system (38); to the 
resulting system, we apply the third assertion of that theorem with K = N 
(applicable as a consequence of (66)). We thus find that the system 


Z =5N7JNZ + 6N“'@(NZ) (69) 


under the transformation 


Z=W+NS(NW) (70) 
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becomes the normal form 


W=65N"JINW + 56N7YNW) . (71) 
Furthermore, 
{N7UE(NW)}, = NUE,(NW) . (72) 


By assumptions (66) and (67), the original systems (69) and (38) coincide; by 
assumption (68), and by virtue of (72), the resonant parts of the normalizing 
transformations are the same. By theorem 1, the normalizing transformations 
(70) and (39) also coincide, as do the normal forms (71) and (40). The theorem 
is proved. F] 


Example 5. Let system (38) be invariant under the exchange of —z, for z,; 
that is, the series y, contains only odd powers of z,, while the remaining ¢, contain 
only even powers of z,. Then the normal form (40) can be chosen such that it is 
invariant with respect to the exchange of w, and —w,. 


§2. The Integration and Classification of Normal Forms 


2.1. Power Transformations 


If all the right-hand sides in the normal form are written as y;g;: 
vi =ViGi =Vid IgV? > (1) 


then Gp = A and the coefficient g;9 differs from zero only when <Q, A> = 0. This 
follows, for non-linear terms, from the definition of the normal form. We will 
show here that it holds for linear terms as well. In fact, the term a,;y, corresponds, 
in the notation of (1), to the term ViGin,-E, ViVi For this term, 


Q=E,;—-E,, 0, A> = 4,-4.5 


but by property b) of section 1.2, 4; — A, = 0 if.a, 4 0. 

Notation (1) and the definition of the normal form allow a simple geometrical 
interpretation. 

We place every non-zero coefficient g;9 in correspondence with a point Q in 
the integer lattice of n-dimensional real-affine space R", where Q = (q;,.--54n)- 
We will denote the set of all such points by D(g,,...,g,) or simply D(G). The 
normal form differs from an arbitrary system (1) by the fact that its set D(G) lies 
entirely in the linear subspace which is orthogonal to the vectors Re A and Im A. 
This allows us, with the help of a power transformation 


uz = Yi. Vai™ 5 i=1,...,n (2) 


to reduce the order of the system, which is a normal form, sometimes even to 
integrate it. Transformations of systems of differential equations by changes of 
coordinates (2) were treated in detail in §2 of an earlier article by the author 
[Bruno, 1965]. 


Lemma 1. Let D(G) be a set of points Q € R" such that in system (1). Gg = 
(9195---»9ng) # 9, and let transformation (2), where the «,; are real and, (if « = («,)) 
det a # 0, transform system (1) into the system 
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a; = ujgi(U) =u, > gig U2 , i=l,...,n. (3) 
ok 


Then 
D(G’) = «* 'D(G) . (4) 


Proof. Let us write 
In Y = (Iny,,...,lny,) , InU = (Inu,,...,Inu,) . 
In vector notation, system (1) takes the form 
(In'Y) = oie Ggexp<Q,In Y> , 
while system (3) is 


(InU)= YY Goexp<Q'inU> , 
Q'eD(G') 


and transformation (2) becomes 


InU=alnY. 


Under this transformation, the term Y2 becomes 


Y? =exp(Q,In Y> = exp(Q,a'InU)> 


= exp<a !*Q,In U) = Ut *2 
and . ' 
(InU)=a(InY)= ) aGg¥2= Yo aGgu* 'e 
@eD(G) oe D(G) 
Consequently, 
Y aGgexp<a*O,InU>= } Go exp<Q',InU) . 
Qe DG) o' ei’) 
Therefore, 
Q’ = a*Q ; Go: = “Go : (4’) 


Thus, the set D(G’) of points Q’ such that Gg. # 0 can be found from D(G) by 
linear transformation (4). This proves the lemma. 0 


Theorem 1. Let system (1) be a normal form, and let 5 be the number of linearly 
independent points Q € N which satisfy the equation <Q, A> = 0. There exists a 
power transformation (2) (a, integers, deta = +1), under which the normal form 
(1) becomes the system 


UW, = ujgi(uy,--..Ms3), t= 1,....n. (5) 


The first 6 equations of this system define a system of order 6, while the remaining 
equations reduce to quadratures. 
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Proof. The proof relies on lemma 1 and on certain properties of free Abelian 
groups with a finite number of generators (see Kurosh, 1953). Let K be a linear 
subspace in R” which spans the set of solutions Q € N of the equation ¢Q, A> = 0. 
Clearly, K is of dimension 6. The vectors of the integral lattice in R" form, under 
addition, the free Abelian group Z” with n generators. M = Z" 7 K isa subgroup 
of Z" of rank 6 which is, clearly, a free Abelian group together with the factor- 
group Z”"/M of rank n — 6. There exists an isomorphism of the group Z” and the 
direct sum M @ Z”/M which is the identity mapping on M. Let the vectors R,, 
..., Rs be generators of M and let Rj,,,..., R, be generators of Z"/M. Taken 
together, they give a full system of generators of M © Z"/M. Their images under 
the isomorphism described above, R,,...,R5,S511,---, S,. give a complete system 
of generators of Z”; that is, the matrix a*, in which the ith column is R,(i < 4) 
or S,(i > 6), is unimodular, and «*1R; = E;. At the same time, the vectors R; 
define a linear basis of K. The transformation Q’ = a*~'Q carries the subspace K 
into the coordinate subspace spanned by E,,..., Es; that is, g5,, =°-- = q, = 0. 

By lemma |, systems (1) and (5) and transformation (2) are related by formula 
(4). But D(G) c K, so that D(G’) < a*'K; that is g, ..., g, are independent of 
gat» +++» Un. The proof is complete. 

For another proof of theorem | which gives a way to calculate the matrix a 
see Bruno [1965]. 


2.2. Classification 


Until now, we have been indifferent to the ordering of the Jordan blocks in 
the matrix (6@;/0x;)9 or the order of the eigenvalues /; in the vector A. It turns 
out that the normal form sometimes has nearly triangular form under the right 
ordering of the Jordan blocks or, equivalently, the correct indexing of variables. 

In what follows, we will distinguish two cases for the eigenvalues 4,,..., An. 
Let these numbers be considered as points in the complex plane. 

Case 1: There exists a line, passing through the origin in the complex plane 
such that none of the points A; lie on one side of the line, while exactly / (counting 
multiplicity) of the points 4; lie on the line. The variables are indexed so that 
these | values are the first ] components of A. 

Case 2: Points 4; lie on both sides of any line through the origin. 

Let Jt be the convex hull of the points 1,, ..., 4,; Jt may be a point, a line 
segment, or a polygon. Clearly, in case 2 M is a polygon which encloses the 
origin. 

We identify four subcases of case 1: 

a) The origin lies outside of It; then there exists a line Mt through the origin 
such that all of the points 4; lie to one side of the line Mt, and no points A; lie on 


Mi = 0); 
b) the origin is a vertex of the polygon ® or an endpoint of the line segment 
N. Then A, =--- = 24,=0(0 <i <n), the remaining /; are non-zero, and there 


exists a line It through the origin such that 4,,,,..., 4, lie on one side of Mt; 


192 Chapter III. The Normal Form of a System of n Differential Equations 


c) Nis a polygon, and the origin lies on one of its edges but not at a vertex; 
let A,,..., 4, be all the points /; that lie along this edge (1 < / < n); then the line 
M, along this edge, is such that J,, ..., 4, lie on Mt, while A,,,,..., 4, lie on one 
side of M; 

d) MN is a line segment which contains the origin in its interior; then the line 
IM which lies along this line segment is such that all of the J; lie on M (i.e., | = n). 

In the following discussion, we will assume that in case 1 we will always 
choose the line IN described above, that / is the number of points A; on Mt, and 
that the variables Y have been so indexed that 


OS eH Hh <i SSH (6) 


where yj; is the distance of the point ; from the line M. Condition (6) determines 
the relative placement in the matrix J of Jordan blocks with different y;, leaving 
arbitrary the relative placement of those blocks with equal y;. We note that the 
line M is uniquely defined in subcases |c) and 1d). In subcases 1a) and 1b), we 
can choose for 9t any line which lies inside of some pair of vertical angles with 
vertex at the origin. In particular, we can choose It such that yw; = p; only if 
A; = A, 

Let us denote by t a complex number of unit modulus such that the two- 
vector (Re t, Im 7) is orthogonal to M and tf lies on the same side of Dt as N. We 
let 

H; = Re(tA,) , vy, = Im(tA,) , j=i,...,n 
and 


M = (fy,--+s a) ? N =(v1,---5¥_) * 
Then 
TA =M+iN (herei* = —1). 


It is easy to see that py, is, as before, the distance of the point /; from the line M. 
It is convenient to imagine that IM is the imaginary axis and that MN lies to its 
right. Then t = 1, M = Re A, N = Im. We note that the equation 


<Q, A> = 0 (7) 
is, for real Q, equivalent to the system of two equations 
(Q,M>=0, (8) 
<Q,N> =0. 


Let V =(v,,...,v0,) be an n-vector; let us write V’ =(v,,...,0,) and V" = 
(Up415-.->0,). By V > 0, we mean v, > 0,..., v, > 0; we will use this notation 
widely. For example, condition (6) implies that Mf’ = 0, M” > 0. 


Lemma 2. If A belongs to case 1, then equation (8) has only those solutions 
QeéN, for which Q” = Oif i <1; ifi > I, then either 
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Q"=E,—E,,  wherep,= 4; , (?) 
or 


Q”= ¥ gE,-E, , where q; 20, m<i, Um <i; , (10) 
jet 


Proof. In consequence of (6), equation (8) is equivalent to the equation 
<Q", M">=0. (11) 


Suppose Q € N,,i < [; then Q” > O and <Q, M> > Oif Q” # 0. Consequently, for 
such solutions of equation (8), 0” = 0. 
Now suppose that QeN,, i> J; then equation (11) has, by virtue of (6), 
solutions of only two types: 
1. "|| = 0, 0" = E, — Ey, if yj =u. 
HO" > 0,4) = — 1, din = 0 = ay = 0. 
These are solutions (9) and (10). 


Corollary. If A belongs to case 1, then equation (7) has only the kinds of 
solutions described in Lemma 2. 


Theorem 2. [f A belongs to case 1, then the normal form has the form (12): 


y=aM, f=aon,t, (12') 
we J, by i + Digs ae VE hare > i=1+ 1,...,n * (12”) 
iz 
Here i, by, and big,,,...q,, are power series in y,,..., yy. The first sum in (12") is 


taken over those j > | for which equation (9) is satisfied; the second sum is taken 
over all integers qy4,,..., 4;-; for which (10) is satisfied. 


Proof. The proof follows immediately from lemma 2. In the normal form (1), 
Jig # 9 only if Qe N; and <Q, A> = 0. If i <], then, by lemma 2, q,4, =--: = 
qn = 0; that is, y,41,--., ¥, do not appear in w; in non-zero powers. Consequently, 
W; is independent of yi4,,---, Va- 

If i> 1, then Q” has the form of either (9) or (10). The corresponding terms 
are exactly those that appear in (12”). 0 


Remark, If 4 belongs to case 1, then subsystem (12’) of the normal form (12) 
is itself a normal form of order /. Analogously to theorem 2, it can be deduced 
from lemma 2 and theorem 2 of § 1 that, in case 1, a transformation from normal 
form (12) to another normal form takes the form (13): 


Yip = NlZ1.---52) » a Pee (13’) 


W= YL eZ )z; + ni(21,---5 21-1) 5 i=l+l,...,n. (13”) 


jeri 
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Moreover, (13’) takes the /-th order normal form (12’) into another normal form 
of order J, while the n; in (13”) contain no terms of less than second degree in Z”. 
We will show here what theorem 2 means in the subcases of case 1. 
la) Here | = 0. Hence, there is no subsystem (12’) in (12), and the series b,, 
and 5j,,,,...9,-, are constants. Moreover, b, = 0 for j > i, since we have chosen 
the Jordan matrix of the linear part of (12) to be lower triangular as in theorem 


1 of §1 of this chapter. Thus, the normal! form is triangular: 
Ji =A + Y Dig, cg, YE VEY» P= 12s (14) 
where the sum is taken over all integers q,,..., qj-) > 0 for which 
Ay = 414, +07 + Qi-14i-1 » et Barre (15) 


Normal form (14) was found by Dulac [1912]. Equations (15) have only a finite 
number of solutions for integral, non-negative q,,...,4q;-,- Hence, the right-hand 
sides in (14) are polynomials. For the same reason, transformations from one 
normal form to another are also polynomial transformations and hence they are 
biregular. The equations of system (14) can, clearly, be successively solved by 
quadratures. 

1b) Here A, =--- = 4, = 0, so the first sum in (12”) is taken over those j for 
which 4; = A,, while the second is taken over integral, non-negative q,41,---)9i-1 
such that q,4, + °° + q;-; > Land A; = qis:Apa, H00° + int Ai-1- 

The integration of normal form (12) reduces in this case to two steps: first, 
solution of the /th order system and second, the successive integration of systems 
of linear equations with variable coefficients. If the 2,,,,..., 4, are all different, 
then the second step is just the process of successive quadratures of case La). 

1c) Here, subsystem (12’) is itself a normal form of /-th order for which 
theorem 1 is applicable. Therefore, integration of system (12) reduces, in this 
subcase, to the solution of a nonlinear system of order less than /, to an integration 
up to the order J, and to the successive solution of systems of linear differential 
equations with variable coefficients. If the distances of the points 4,4,,...,2, from 
the line M are all different, then (8) has no solutions Q” ¥ 0, and it is not necessary 
to solve systems of linear differential equations, but only to integrate linear 
equations successively. 

1d) In this case, theorem 2 yields no simplification at all. 

Suppose that L is the linear subspace in R{ defined by the real solutions of 
equation (7), and that 0 = {Q > 0} is the non-negative orthant (quadrant, octant, 
etc.). The above classification of A can be geometrically interpreted (in R{) in the 
following way: 

la) LA 0 = 0. That is, if 90 > O and Q # 0, then <Q, A> # 0. In fact, for such 
vectors Q, the complex number <Q, 4)/||Q|| lies either inside or on the boundary 
of the convex hull 9% of points 4;, and hence must be non-zero. 

1b) Ln 0 is an /-dimensional face of the cone 0, defined by the equation 


Q”=0. (16) 
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1c) LL O0is an(! — 1)-dimensional subset of an /-dimensional face of 0 defined 
by equation (16). 

1d) Here L has dimension n — 1 and intersects 0 at interior points. If 1a), 1b), 
and Ic) are considered as degenerate cases, then this is the regular case, since the 
intersection does not degenerate to part of the boundary of 0. Generally speaking, 
here it is impossible to expect any simplification of the normal form other than 
the reduction of the order of the normal form by at least one (theorem 1). 

2. This case differs from 1d) only in that the dimension of L is n — 2. 

For more detailed properties of the generalized triangularity of normal forms, 
see section III, §2 in Bruno [1971, 1972a]. 


2.3. One-dimensional Normal Forms 


This section follows from the author’s article [1973c] which discusses other 
related questions as well. Let us consider the formal system 


90 
ay =uyg (uy) , 91 = Git, Jim #0, m>O, 
=m 
(17) 
tz = U2g2(u,) , 92> a Gatlt . 


If g.9 #0, then this system is a normal form, since 4, = 0, 2, = goo and the 
vector exponents have the form Q = (k,0). According to theorem 2 of § 1, trans- 
formations which take normal form (17) into other normal forms have the form 


2) 

u, = 01d, (v,) = 2, Dm dvi dig #0, 
(18) 

uz = v2d,(v,) = v2 », dy,0; , do #0. 
Theorem 3. There exists a formal transformation (18) with do = dy) = 1, 

which takes system (17) into the form 
By = 0, (%m0T + YamP7™) » 

m (19) 
By = v2 B(v,) = v2 2. Burt 


where all coefficients are uniquely determined by system (17). 


Proof. We begin by deriving the first equation. Let transformation (18.1) take 
equation (17.1) into the form 


B=, 702,), y= mvt. 
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Then the series g,, d,, and y are related by the equation 
dd, m+1gmt1 
da, am. 01Y = 2,4,9,(014d,) =Ginet ap +o, (20) 
1 


and from (20) we obtain an equation for the coefficients of v™*!**: 


(1 + Kd sa ¥m + Vkt+m = Iim(m + Ndi, + Cm+itk > (21) 


where Cm4i4x iS a polynomial in d,; and y,,,; with j < k. For k = 0, since dy) = 1, 
formula (20) gives y,, = 913 equation (21) takes the form 


Gimlk — M)dim + Yatm = Cm+itk (k>0). 


This equation will be satisfied, if, successively, for k = 1, 2,..., we take 
Ne+m =O, dy. = Cmte im(K —m)'fork#Am, 
Yan Cimsi > d,, = an arbitrary number . 


Thus, equation (17.1) takes the form of (19.1) under transformation (18.1); then 
equation (17.2) takes the form 


ip =upBlv,), B= > Bot. (22) 


k=0 


The coefficients y,,, Yams Bi meee By do not depend upon the number d,,,, i.e., they 
are uniquely defined. It is easy to verify that the coordinate change 


Uz = v2 exp J (= fact) or mot + PombT") | dv, (23) 


has the form of (18.2) and takes equation (22) into form (19.2), where B, = fy. 
This proves the theorem. (] 


If we make a time transformation as described by Bruno [1971, 1972a, in 
example (3c) of the introduction], system (17) with go. # 0 becomes the system 


dv,/dt, = vt" , dv,/dt, = v2(Bo + Batt) 


under a formal change of variables. 

It follows from theorem 3 and the results of Horn [1899, 1900, 1913] (see also 
Bruno, 1971, 1972a, example (3c) in the introduction; Martinet et Ramis, 1982) 
that the analytic system 


X, = X19,(X1) , Xz = X2g2(x,) + a(x) , 920 #9 (24) 


can, under an analytic change of variables x, = y,d,(y,), X2 = y2d2(y1) + c(1), 
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take the form 
Yi =ViGm¥T + Yami") » 


Yo = 2 DS Beyt + ¥ yt. 
K=0 K=1 


That is, system (24) has, in the class of analytic transformations, m new invariants, 
the b,, in addition to the m + 3 formal invariants of y and £. The number of 
coefficients appearing in system (19) is m + 3; one of these can be set equal to 
(unity) by a linear transformation y, = ¢,z,. Consequently, normal form (17) has 
m + 2 continuous invariants with respect to formal coordinate changes. 

Note that the dimension of system (17) is 6 = 1. Now consider arbitrary 
normal forms (1) with 6 = 1. That is, all solutions Q € N of the equation (Q, A> = 
0 are proportional to one vector R. We will distinguish two cases: 

a) One of the coordinates of the vector R is equal to — 1. We can assume that 
r, = —1. The normal form is 


Y= AY; » i=1,....n-—1, 
Vn = AnYn 7 ay? es Vat ‘ 


The value of the coefficient a is affected only by linear transformations y; = ¢;J; 
such that @ = C¥a. Therefore, in the class of complex transformations a is either 
0 or 1; in the class of real transformations the sign of a can be preserved as well 
(if a A Q). 

b) All the coordinates of the vector R are non-negative (r; > 0). The examina- 
tion of this case is the basic aim of this section. We will assume that the integral 
vector R is the smallest possible, i.e., that the greatest common divisor of its 
components r; is 1. Then the normal form (1) is 


wo 
et ee ee (25) 


Let us write G, = (Gigs-++5Gnx)3 Clearly Go = A = (A,,...,4,), $0 that (R, Go> = 0. 
Consider first the case <R, G> # 0; since (R, G> = 2<R,G,>Y*®, then for some 
natural number m 


<R,G)> =Oforj<m,  <R,Gp> #0. (26) 


Note that, in agreement with theorem 2 of § 1, the transformation taking normal 
form (25) into another normal form has the form 


y=t VidyZ®, isdn. (27) 


Theorem 4. If property (26) is satisfied by the formal system (25), then there 
exists a formal, invertible transformation (27) with dig = 1 which takes system (25) 
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into the form 
2 se 2($ Gy Z* + ar,Z2"® | 3 i=1,...,n, (28) 
where all coefficients are uniquely determined by system (25). 


Proof. Let S, = R, and let S; = (5;;,..-, Sin), i= 2, ..., n, be integral vectors 
such that the determinant det(s,,) = 1. For the proof of their existence see section 
2.1 of this chapter. We make the power transformation 


Up = Ye I Hsin. (29) 
Then, in accordance with section 2.1, system (25) becomes the system 
u; = ujgi(u,) , i= l,...n, (30) 
where g; = <S,,G). In particular, by property (26), 
Gi =GimUl +7, Jim FO. 
Applying theorem 3 to system (30), we find that there exists a formal transforma- 
tion 
u,; = v,d,{v,) , d(0)=1 , i=1,...,n, (31) 
which takes system (30) into the form: 


By = 0, (YimYT + ©, 07") , 
{om P (32) 
i= rd radt + ow?) i=2,...,n, 
k=0 


where the coefficients «,; (i > 1) can be arbitrarily chosen, but w, and 4, are 
uniquely deseuained: In order to find the w;, we must replace py Bvt i in (23) with 


Y B,vk — w,v2". We apply 
vp, = mete ze, LSD, i.05% (33) 


so that system (32) becomes 
i= a( Y Gn Z*® + @,z2"*) : P= 1ywg ny (34) 
k=0 Z 


where G, = af, 2 = a, and a = (ay) = = (s,)"'. 

In order to obtain Gx Ra, it is sufficient to assume that Q = a7'Ra. Then 
system (34) will have the form of (28), where a = w,/(r? + ++: + r2), sincea IR = 
(s,)R = (<S,,R, (Sz, R>,.-., <S,,R>) and S, = R. It remains to be shown that 
the transformation from Y to Z has the required form. In fact, we can find from 
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(29), (31), and (33) that 
yy = ott. osindt(v,)...dt(v,) = 2d (Z%),..d%(Z") . 


Since the d; are series in non-negative powers of v, with constant terms equal to 
unity, then a product of arbitrary integral powers of the d; have the same form. 
That is, the transformation from Y to Z has the required form. 


Remark. Theorems 3 and 4 leave us free to apply a linear transformation 
y; = ¢,¥,;, which can reduce by one the number of continuous invariants. For 
example, we could require in system (28) that <R, G,,> = +1. Then the invariants 
of system (28) are the eigenvalues, the coefficients g,, (related by one condition 
for each i) and the number a. There are thusn + m(n — 1) + 1 =mn—mt+n+4+1 
invariants. In the real case, the sign of (R, G,,) can also be invariant. 

We shall now take up the case where <R, G> = 0 in the normal form (25), i.e., 
property (26) does not hold and 


CRG>=0, jf=0,1,2,.... 


In this case transformation (29) takes system (25) to system (30) with gj (u,) = 0, 
Le., 


u,=0, 
(34’) 
a; = uigi(u,) , gi(0) #0, i=2,3,....n. 
The change of variables (31) carries this system to the system, 
t,=0, 
(34”) 


B;, = vigi(v,4,(v1)) = vgj(u,) , i=2,...,n. 


Thus, only the one series d,(v,) of the n series d,(v,) of transformation (31) is 
effective; the others have no influence on the system (34”). But with the help of 
just one series only one series can be simplified. If g3, ..., gj, are constants (ie., 
independent of u,), then the series d,(v,) has no effect on the problem and 
gi(v,d,(v,)) = g;(v,) = g;(0) in (34”). In this case the normal form (25) has the 
form 

Vi = AY; » i=1l,...n, 

i.e., it is zero dimensional and unique. If among the series g3(u,), ..., g,(u,) at 
least one is non-constant, then there exists a vector T’ such that the series 


(TG) = elu) = Yay 


differs from a constant. Let 1, be the first non-vanishing coefficient in the 
sequence T,,T,.... Then t(u,) = t) + t,v! if we set 
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t 

tT — To Tr+1 

= =u,+ upto, 
Ty It, 


and we obtain u, = v,d,(v,) by inverting this expansion. Then in system (34”) 
we have that 
€T',G'(v,d\(v3))> = To + Td} - 


Ifn = 2 then it is sufficient to take T’ = (0, 1). In this case in system (34”) we have 
g2(v,4,(vy)) = 930) + tv}, 


and the normal form (34”) contains the single invariant t, which can be normalized 
by a linear transformation 6, = cv,. If n > 2, then in place of the vector T’ we 
can take the first non-zero vector G, in the expansion 


constructed for system (34’). Then for the given choice of the series v,d,(v,) 
system (34”) is a normal form in the sense of Belitskii [1979b]. However, for n > 2 
system (34’) has infinitely many formal invariants. By means of the inverse 
power transformation (33) we move from system (34’’) to the normal form 


2;= 2:9: » i=l,....n, 


which is associated with system (25) by an invertible formal transformation. Thus, 
everything claimed for systems (34’) and (34”) also holds for this system. 

Thus, even in the case of a simple resonance, if n > 2 and one of the normal 
forms (25) of the original system does not satisfy property (26), then the original 
system may have infinitely many invariants with respect to invertible formal 
changes of variables. 


2.4. The Secondary Normalization 


In the last section, we showed that if 6 = 1 the “normal” transformations of 
theorem 2, § 1, essentially simplify the normal form and are of use in finding all 
of its formal invariants. 

Such a situation is encountered in other cases as well. One example is a 
Hamiltonian system in the case of resonance (see theorem 2, § | in Bruno, 1970b). 
Another is a system in which the Jordan form of the matrix A of the linear part 
is non-diagonal. Let us discuss the latter in more detail. According to Belitskii 
[1975a, 1978, 1979a, 1979b], the normal form can be put in the form 


Y=JY+ WY), (35) 


where the formal series ¥ satisfies the equation 


oe ITY = J*¥(Y) ; (36) 


fa} 
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(J* is the transpose matrix of J.) If we introduce the diagonal matrix L = 
{A,,..-,4n}, then the definition of the usual normal form is written (see Sternberg, 
1958) 

ib 


ay LY = LYY) . (37) 


If equation (36) is satisfied, so is property (37) and, additionally, 


ow 
oY 


(J* — L)Y =(J* — L)WY) . 


This additional property, although it gives some simplification to the struc- 
ture of the non-linear terms, nevertheless, yields no further reduction of the order 
of the system (as did theorem 1, § 1, and theorem 1, § 2). 


Example 1 (see Belitskii, 1975b, 1978, 1979b; Bogdanov, 1976; Sadovskii, 


1976). Let n = 2, and 
Se 0 0) 
“Ade Oye” 


Since 4, = A, = 0, every two-dimensional formal system (35) is a normal form. 
But property (36) is satisfied only for systems of the form 


Ji = 4(y2) + yi Bly2) , 
(38) 


2 =V1 + Y2B(y2) » 


where (0) = «(0) = da(0)/dy, = 0. But the “dimension” of this system is still two. 
Series «(y.) and B(y,) can be uniquely determined under formal transformations 
with identical linear parts, but they can be changed by a transformation y,; = 
az, +°°', yy = bz, + az, +-°:. For generalizations of (38) see Belitskii [1979a, 
1979b], Cushman and Sanders [1985], Sadovskii [1982] and Palamodov [1983]. 

In accordance with theorem 1, under a power transformation the normal 
form becomes a system which includes a subsystem of order 6 with vanishing 
eigenvalues. As was shown in the preceding section, if 6 = 1 it is possible to 
calculate all of the formal invariants of this system. But when 6 > 1 it is, generally 
speaking, impossible to find all of the formal invariants. For example, when 6 = 2 
this is the problem of investigating a non-elementary singular point (see §3 of 
chapter II). If 6 > 1, a secondary normalization does not yield a substantial 
simplification of the normal form. The local method approach gives a better 
perspective here. It consists of dividing the neighborhood of a non-elementary 
singular point (i.e., when A = 0) into parts, and finding the “elementary” normal 
form for each of those parts. In this context, the class of useful transformations 
is wider than the class of the usual formal transformations in the form of power 
series (see chapter IV). 


§3. Analytic Integral Sets 


3.1. Statement of the Problem 


We will consider the system 
X; = O(X) , of (0) =0, b= 1g hs (1) 


for which the origin X = (x,,...,x,) = 0 is a stationary point. 
Problem 1: find all invariant analytic sets in the neighborhood of the point 
X = 0 which include that point. Here following Bruno [1975c], we shall briefly 
present, without proofs, some results fundamental to the solution of this problem. 
Instead of the problem | we shall solve Problem 2: find the sets in which 
system (1) can be normalized by means of an analytic change of coordinates. In 
particular, when is the transformation 


x, = E(Y) , i=1,...,7 (2) 
of system (1) to the normal form 


Ji = W(X) = yi me Jig? , i=1,...,n (3) 
analytic? pee 


The answer to the last question was found by Bruno in [Bruno, 1971, 1972a], 
while problem 2 was answered by Bruno in [Bruno, 1974c]. In sections 3.2~3.7 
we will, to simplify our presentation, assume that the eigenvalues of the system, 
Ay, -++5 Ans are pure imaginary, i.e., 

ReA=0. (4) 


3.2. Convergence of the Normalizing Transformation 


The normalizing transformation is not often analytic. We can therefore ask: 
for which normal forms (3) does the analyticity of the normalizing transformation 
(2) follow from the analyticity of the original system (1)? The definitive answer 
to this question appears in the works of Bruno [1967, 1971, 1972a], and is 
contained in the following conditions. 


3.3. On Sets 203 


Condition A. In normal form (3), W; = Aiyia, i= 1, ..., n, where a = a(Y) is 
some power Series. 


Condition . Let w, = min|<Q, A>| for QEN, ||Ol| < 24 and <Q, A> £0. 
Then the series 


oO 
¥, 27* ina, 
k=1 
converges. 
Condition w is a weak arithmetic condition on the eigenvalues 4,,..., 4,, 


which is almost always satisfied. In contrast, condition A places strict limits on 
the normal form, since it requires that the n series \; be defined in terms of a 
single series a. 


Theorem 1. If, for system (1), the vector A satisfies condition w and the normal 
form (3) satisfies condition A, then the normalizing transformation (2) is analytic. 


Moreover, it is shown in the author’s works cited above that condition A 
cannot be weakened. That is, if the normal form (3) does not satisfy condition A 
then there exists an analytic system (1) which is transformed into the normal 
form (3) by a divergent transformation. The normalizing transformation is thus 
not analytic, and the analytic normalization of (1) is not possible in any neighbor- 
hood of the point X = 0. 


3.3. On Sets 


Let the functions f,(X),..., {,(X) be analytic at the origin, and let them vanish 
there. Then the system of equations 


G§(X)=0, fHlgs (5) 


defines an analytic set which contains the point X = 0. In the ring of conver- 
gent power series, there is an ideal ¥ with basis f,,..., f, which corresponds to 
the set .“. If f,;,..., f, are formal power series, then we will say that the system 
of equations (5) defines a formal set .@, to which there is a corresponding ideal 
#, with basis f;, ..., f, in the ring of formal power series. The set .#@ will be 
analytic if the ideal Y has a basis of convergent power series. 

We will call the set .@ a (local) manifold if the system of equations (5) can be 
solved with respect to s of the coordinates x,,..., X,. 

Under a formal invertible coordinate change (2), a one-to-one correspon- 
dence is established between formal sets. We will therefore consider the image 
and pre-image as one set in different coordinate systems. In particular, a manifold 
remains a manifold in any system of coordinates. 
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The set .4@ will be integral (or invariant) for system (1) if 


of, 
\ gine dh Pei... 
id 


Ox 
1 


Problem 1’: Which formal invariant sets of system (1) are analytic? 

We ask this because it is relatively easy to calculate the formal invariant sets 
for the normal form (3). We need only learn how to determine which of those 
sets are analytic for the original system (1). Ordinarily, this problem has only 
been posed for manifolds (see Bruno, 1974a, 1971, 1972a, § 10), 

Suppose that, for the system 


3,=0(Z), i=1,...,n (6) 


with 6,(0) = 0, the matrix of the linear part is triangular, and its diagonal is 
A =(A,,...,4,). Analogously with the notation of (3), let us write 


6,= 2; ¥ higZ? , i=1,...,n. 
We isolate the “resonant parts” 
6:4 = 2; > higZ® where (Q, A> =0. 
Let the formal set 
M =%Z: fi(Z) = + = f(Z) = 0} (7) 


be integral for system (6). We will say that system (6) is normalized on the set (7) 
if all the differences 6, — 6,4, i= 1,..., n, belong to the ideal Y of set (7). For 
example, the normal form (3) is normalized on any set. 


3.4. The Set 


For the normal form (3), let us define the formal set 
A=W, = Ayia , i=1,...,n}, (8) 


where a is a free parameter. We can eliminate a from the equations, and obtain 
a representation of the set «/ in the form of (7). In other words, .o is that set for 
which condition A is satisfied. 


Theorem 2. If all the eigenvalues 1; of system (1) are pairwise commensurable, 
then the set x is analytic, and there exists an invertible, analytic change of 
coordinates 

x, =n (Z) , i=1l,...,.", (9) 


transforming system (1) into system (6), which is normalized on the set oA. 


3.5. The Set #@ 205 


If condition A is satisfied, then .%/ is a full neighborhood of the stationary 
point and, in accordance with theorem 2, system (1) can be normalized through- 
out the neighborhood; this also follows from theorem 1. 


Example 1. Let system (1) have two variables x, and x, and a small parameter 
g (that is, é = 0); also, let 4, = —A, =i. Then, in accordance with theorem 4 of 
§ 1, the normal form is 


E=0; Y= VgleVy)=y. fale. 
Let us write 9 = y,y2. The set 
AB ={ yi yi W=Aya jaly=DrP seh, 

where 

a'={y,=0}, a> =f{y,=0}, 

Af? = {8, V1, 92! G1 + 92 =0} . 
sf and .f? are the coordinate axes. By theorem 2, the set .o/ is analytic. If 

ge =A top t+ bet, 
then the component set .9/° is defined by the equation 
PtG2B=, +e)pt+(h + het =0. 


In the general case, b, + 6, 40, and, by the implicit function theorem, this 
equation has unique solution ¢ = é(p); that is, the component set .7? is a mani- 
fold. For a given real system (1), it includes periodic solutions; the passage of ¢ 
through the origin results in a bifurcation of these solutions (compare with 
Arnol’d, 1972, § 5.6; Pyartli, 1972; Leontovich and Tareyev, 1972; and Tareyev, 
1973). This is a Hopf bifurcation. 


3.5. The Set # 


In theorem 2, the requirement of pairwise commensurable eigenvalues 4,,..., 
2, excludes the appearance of small denominators <Q, A). If these denominators 
can be arbitrarily small, then the set ./ is not necessarily analytic (this was shown 
for manifolds by Bruno, 1974a, 1974b). 

Let L = {2,,...,4,} be a diagonal matrix. We will examine, on the set #7, the 
matrix 


where a is the parameter which appears in the equations (8) which define the 
set &. We define the formal set # as that subset of »/ on which the matrix B is 
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nilpotent. That is, 
B={Y: Ye xo, B"=0}. 


Theorem 3. If the eigenvalues 1,,..., 1, of system (1) satisfy condition w, then 
the set B is analytic, and there exists an invertible analytic transformation (9) which 
takes system (1) into a system (6) which is normalized on the set B. 


3.6. Properties of the Sets o/ and B 


We assume for simplicity that the norma! form (3) is analytic. Let us examine 
the properties of the solutions of this system in the sets of and &. 

1) All solutions in Re. are conditionally periodic (including periodic and 
constant solutions). In fact, the value of the parameter a is constant on every 
solution, and we have 

y,=yeexpdat, i=1,...,n. (10) 


2) Let # be the coordinate subspace 
Yn’ +1 =o =)n =O0(n' < n) . 


Let us denote by r = r(#) an integer such that among 4,,..., A, there are r 
numbers which are linearly independent over the rationals, while any r + 1 of 
the A,,...,A,- are linearly dependent. 

Then any solution (10) on the set Re(/ ™#), which does not lie in any 
smaller coordinate subspace is conditionally periodic with r basic frequencies 
(O<r<n’). 

3) For a conditionally periodic solution (10) in the set /, the basis of 
frequencies is entirely determined by the value of the parameter a. But the 
eigenvalues can be arbitrary. 

4) For a conditionally periodic solution (10) in the set @, the value of the 
parameter a determines the eigenvalues as well as the basis of frequencies. This 
property distinguishes the set Z from &. 


3.7. A Refinement of Theorem 2 


Let us write 
A= U (POH), 
(<1 


where the union is taken over all coordinate subspaces % (in Y coordinates) for 
which r(#) < 1. 


Theorem 4. The set o is analytic for system (1), and system (1) can be 
normalized on that set by an analytic transformation (9). 
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This is, clearly, an extension of theorem 2. We note that all formal families 
of periodic (r = 1) and constant (r = 0) solutions are found in the set o. By 
theorem 4, these families are always analytic. The set 7 cannot contain formal 
families of periodic solutions with r = 0 only (see Bruno, 1974c). Thus, under 
condition w the analytic set «J U & exists for system (1). 


Example 2. We consider the Hamiltonian system with m degrees of freedom: 


oh 
oh ee j=l,....m, (10’) 


x= x; 
j ’ ‘jtm , 
OXj4m Ox; 


where the Hamiltonian A is analytic at the origin, and its expansion begins with 
quadratic terms. The eigenvalues come in pairs of pure imaginary numbers with 
opposite sign: 


The normal form (3) is likewise a Hamiltonian system 


. oh . oh SI (11 

“u.=—, = —— ; = Byicsg hs 

* 00; du; a 
where the series h contains only resonant terms. Let «,, ..., a, be linearly 


independent over the rational numbers. Then in the normal form (11) we have 
that 
h = h(p1,---. Pm) » where p; = u,v; , j=l1,...,m.. 


We therefore have 


f oh oh 
8 =<U,V:u;,— = dua, v;— = A,v,a,f=1,...,m). 
1 J 6p; yd ” dp; os J } 

Let us examine the set . in the Cartesian coordinates P = (p,,...,p,,)- Then 
every coordinate subspace (in P) contains one component of the set .% which 
does not lie in a smaller coordinate subspace. Consequently, the set .o consists 

m! 
d'(m — d)! 
components which lie in d-dimensional subspaces of the P coordinates, for each 
of which r = d. In particular, there is one component, 


of 2" — 1 such components; of these, for every d < m, there are exactly 


PY ee 
= <P:— =A,a,j=l,...,m>, 

‘ ap; oe 
which is situated outside of the coordinate subspaces. On this component, r = m, 
and B? = 0. In the general case, the matrix B is not nilpotent on coordinate 
subspaces, so that @ = f°. Moreover, the m components 


em orn Differentia quations 
A, ={P:p,=0,j Fi}, i=l,....m, 


are the axes in the P coordinates and have r = 1; taken together, they comprise 
the set .o/. By theorem 4, system (10’) has m one-parameter analytic families Re A; 
of periodic solutions (these are the families of Lyapunov, 1935a). If the numbers 
a; satisfy condition w, then by theorem 3 the component .o/° will also be an 
analytic set. If this component has a real part, then it is a one-parameter family 
of m-dimensional invariant tori with a basis of frequencies «,a,...,0,,@ and with 
zero eigenvalues. As a — 1, the tori of this family shrink into the fixed point 
xX =0. 


3.8. The Part of the Spectrum not on the Imaginary Axis 


Suppose now that the system 
xX; = 9{X) , i=1,..,k+lim, (12) 


analytic in the neighborhood of the fixed point X = (x,,.-.,Xz4i4m) = 0, has / 
pure imaginary eigenvalues (A,,...,4,), k with negative real parts (K,,...,«,), and 
m with positive real parts (4,,...,,,). Then system (12) has three formal integral 
manifolds: W_ (corresponding to k,,..., Ky, 4,,--.; 4,), Wy (corresponding to 
Ags ees Aty Mis -+-> Um) and ¥Y = W_ OW, (corresponding to A,,..., 4,). 

By means of a formal invertible change of variables X > Y, U, V system (12) 
is reduced to the semi-normal form 


Y= W(Y)+ WY,U,V) , 
U = O(Y,U) + O(Y,U,V) , (13) 
V=Z(¥,V)+Z2(¥%,U,V), 


where YY, U,0) = ¥Y,0, V) = 0, O(¥,0) = O(¥,0, V) = O(¥,U,0) = 0, Z(¥,0) = 
Z(Y, U,0) = ZY, 0, V) = 0. That is, the manifolds W_, W,, and ¥ are the co- 
ordinate manifolds {V = 0}, {U = 0} and {U = 0, V = 0}, respectively. In addi- 
tion, on the manifold ¥, system (13) induces the system 


Y= WY), 
which is a normal form, while on the manifold W_ system (13) becomes 
eV) SOs 5 


which is a semi-normal form in the sense of Bruno [1971, 1972a, § 9] (as is the 
system induced on the manifold W,). 

Now, just as we did above, we will define formal sets x, 4, and of on the 
manifold ”. Theorems 2-4 remain applicable to these sets if we substitute 
semi-normalization instead of normalization. Moreover, for every formal set 4 
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in the manifold ¥, we define a set .@_ on the manifold W_ as a set in Y, U 
coordinates for which Y e€ @ and the U coordinates are arbitrary. Similarly, we 
define a set #, ¢ W,. It turns out that the sets &_,./,, B_, B,, S_, A, are 
analytic whenever the corresponding sets ., 4, and of are (see Bruno, 1974c, 
1975a). In the sets Re .o/_ and Reo, hie solutions which asymptotically approach 
tori in the set Re.e as t > +00 and t > —oo, respectively. By this means we 
obtain invariant sets which are fibers on “tendrilled tori”. 


3.9. The Neighborhood of a Torus 


Suppose that for a system of ordinary differential equations there is an 
irreducible invariant j-dimensional torus 7, filled with conditionally periodic 
solutions (j > 0). Let the system be analytic in some neighborhood of this torus. 


Problem 1”: In the neighborhood of the torus 7, find all invariant analytic 
sets containing the torus. 

We will assume that the system on the torus and the variational system are 
reducible. Then the torus 7 has a frequency basis and a set of eigenvalues x,, 4;, 
Ke 

If j = 0, then the torus 7 is a fixed point; the corresponding problem was 
discussed in previous sections. 

If the given torus 7 is not a fixed point (j > 0), then it is either a periodic 
solution (j = 1) or an irreducible torus with conditionally periodic solutions 
(j > 1). 

In the first case, the variational system is always reducible, and eigenvalues 
exist. Therefore, normal and semi-normal forms exist, and the constructions of 
sets J, B, and & are easily extended. The corresponding extensions of theorems 
3 and 4 are also valid (see Bruno, 1974c, 1975a). In particular, every formal family 
of periodic solutions which adjoins the given periodic solution 7 is analytic. 

In the second case, the reducibility of the system on the torus 7 and the 
reducibility of the variational system are additional restrictions (on topological 
obstacles see Bylov, et al, 1977). After the frequency basis and eigenvalues are 
found, the construction of the semi-normal form and the selection of the sets ./ 
and # can proceed as above. An extension of theorem 3 is valid (see Bruno, 1974c, 
1975Sa). 

In Bruno [1974c], it is shown that the periodic solutions found by Poincare 
[1971], Lyapunov [1935a], Siegel [1956] and others lie in the set .o/; the condi- 
tionally periodic solutions found by Kolmogorov [1954], Arnold [1961], Moser 
[1962, 1967], Bogolyubov [1964], and others lie in the set &. 


§4. The Normal Form and Methods of Averaging 


4.1. Local Coordinates 
Let the autonomous system 
dS/dt = 8 = F(S) , (1) 
where the vector S is (S,,..-,S:4m+n), be defined and analytic in a domain @ of 
the complex space C'+™*" and let it be real in ReY. In ReG, let system (1) have 
an invariant analytic manifold .4 of dimension / + m, which is fibered into 


m-dimensional invariant tori 7™. Local coordinates for the manifold .@ are 
functions 


X = (X41,...,X1) ; Y =(Vp5-0+s Vm) > Z = (245-++52Zn) (2) 


of S, which are defined and analytic in some complex neighborhood of the 
manifold .@ and possess the following properties: (I) Z = 0 on the manifold ./; 
(ID) the equations X = const, Z = 0 define the m-dimensional invariant torus 
FJ" = {Ymod 2z}; (If) the Jacobian D(X, Y, Z)/D(S) is non-zero on .@. The 
manifold Re.@ is the fiber bundle of the m-dimensional torus 7” = {Y mod 27} 
on the /-dimensional domain # with respect to X. In local coordinates (2), system 
(1) has the form 


X = BX, ¥,Z) , 
Y = Q(X, Y) + 6% (X,¥,Z) , (3) 
Z = A(X, Y)Z + OP1(X, Y,Z) = Z(X,¥,Z) , 
where 6 = O(|Z|), k = 1,2; B = O(|Z|?); and A is a matrix. 
Basic assumption. There exist local coordinates (2) such that in system (3) 
Q(X, Y) = Q = const (4) 
and the matrix A is triangular with a constant main diagonal: 


a,(X,Y) = 0 fori < j, a,(X, Y) = 4; = const . (5) 
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That is, system (1) is reducible on all invariant tori 7™ < .M, and it has a 
unique frequency basis; moreover, the variational system is reducible and its 
eigenvalues A; are the same for every torus 7™. Here, to real coordinates S$ 
correspond real coordinates X and Y and complex values of the coordinates Z 
which are connected by the relation of reality (see section 1.9 of this chapter). We 
denote by A = (A,,...,4,,) the main diagonal of the matrix A. 


4.2. The Normal Form 
Let the function f(X; Y) be analytic in X and Y and 2z-periodic in Y, where 
{Xe H#,|ImY|<e}. (6) 
This function can be expanded in a Fourier series 
f=Y fol(X)expi<P, Y , PeZ™, (?=—-1), 


which converges absolutely on the set (6) (recall that <P, Y> = Y pi). All such 
functions form a ring, which we denote by A(X; Y). By A(X; Y)[[Z]] we denote 
the ring of formal power series 


f=VSIKY)Z2, 0<QeZ", (7) 


where Z2 = z?...z4 and Sg € A(X; Y). Every series in this ring can be expanded 
in a unique Taylor-Fourier series (also called a Poisson series) 


f= Y feg(X)Z° exp ixP, Y>. (8) 

Note that the functions 6 in system (3) allow convergent expansions of the 
form (7) and (8). 

We will now simplify system (3) by means of a formal change of local 


coordinates 
X=U+ 250° (U,V,W) , 


Y=V+ 2@(U,V,W) , (9) 
Z=W+ BUU,V)W + FOU, V,W) , 


where the series ¢\ « A(U; V)[[W]] and contain no terms free of W, the €°) 
contain no linear terms in W, and the matrix B is triangular with zero diagonal. 
Let transformation (9) carry system (3), (4) (5) into the formal system 


U = 0 "U,V,W) , 
V=2+ 02(U,V,W) , (10) 
W = C(U,V)W + OP%'(U-,V,W)=¥, 


where the 6!", 6{?, and yj; all belong to AU; V)[[W]], the 0 have no terms 
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free of W, the @® have no linear terms, and C is a triangular matrix with constant 
diagonal A. We write 


9 =¥%/w, => Gjpq(U) W2 expi<P, V> > JHl,..., n. (11) 
Here, P runs over Z”, while Q runs over the set 
Nj = {Q:Q€Z",Q + E,>0, 1OI > 0} ; 


we write N = N, u--- UN, (as we did in sec. 1.2). We will use the usual expan- 
sions for 0 and 0): 


Oo =F Of (U)WlexpiKP,V), k=1,2. (12) 


We call the formal system (10) a normal form if, in expansions (11) and (12), the 
only non-zero coefficients g;pg and Og are those for which the indices P and Q 
satisfy the equation 

iKKP,Q>+<Q,4> =0. (13) 


Restriction 1. For every Q € N, as|P|— 
lim | P|" Ini P,Q) + <Q,A>| 20, (14) 


where the limit infimum is taken over those P € Z™ which do not satisfy equation 
(13). (See Bruno, 1975a.) 


Theorem 1. For the system (3), (4), (5) under restriction 1, there exists a formal 
transformation (9) to the normal form (10). 


The advantage of the normal form (10) over an arbitrary system (3) lies in the 
fact that expansions (11) and (12) contain only resonant terms. Hence, the 
solution of system (10) reduces to integrating a system of lower order (see sec. 
2.2). The normal form (10) and normalizing transformation (9) are not uniquely 
determined by the original system. The normal form (10) for a manifold .@ is a 
generalization of the normal form for a fixed point (§§1-3 of this chapter). 
Properties a)—d) below are the corresponding generalizations: 

a) The normalizing transformation preserves the relation of reality between 
the local coordinates (section 1.9, Chapter ITI). 

b) If one of the equations of the system (3) has the form & = const., then the 
corresponding coordinate (x,, or y,, or z;) is unchanged under the normalizing 
transformation. Since a parameter satisfies the equation « = 0, parameters are 
unchanged by the normalizing transformation (see section 1.6). Here parameters 
may appear in any of the coordinates X, Y, or Z. A second consequence is that 
the time does not change in the normalization of non-autonomous systems since 
it satisfies the equation f == 1. 

c) As a rule, the normalizing transformation diverges in any neighborhood 
of the manifold .# (Bruno, 1971, 1972a), but it may converge on some sets defined 
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with respect to the normal form (§3). Let the numbers 2,, ..., 4,: 
iw,, ..., iw, be commensurable (in particular, Re A, =-:- = Red, = 0). The 
normalizing transformation converges on the set 


AA, ,...54y) = {U,V,W: OY = 0, 0? = Qa, 
bj = Ajwja, j = I... n’, Wye.) =" = WW, = 0} 


and this set is analytic for the original system. In this case Re. (A,,..., Aq) 
consists of periodic solutions. The construction of an analytic set @ which 
contains conditionally periodic solutions is generalized analogously. 

d) The normal form is useful for the approximate integration of system (1) 
and the investigation of the stability of the manifold ./, especially of its formal 
stability (see Bruno, 1970a, 1976a; Nekhoroshev, 1971, 1973a, 1973b, 1977). 

e) The normalizing transformation commutes with a change of local co- 
ordinates on .@ of the form X = X(X), Y = C + DY, where X is an analytic 
function with non-zero Jacobian, C is a constant vector, and D is a unimodular 
matrix. 

f) The normalizing transformation likewise commutes with the operation of 
restriction of the manifold .@ to a submanifold .@’. That is, the normal form on 
MM’ can be obtained from the normal form on .@ by means of a corresponding 
specification of local coordinates. 


4.3. Calculating the Coefficients of the Normal Form 


Calculating the coefficients of the normal form (10) may be quite cumbersome. 
We will see, however, that it may sometimes be simplified. We write out the 
expansions of the right-hand sides of system (3): 


OY = PS D(X)Z2expiKP,Y>, k=1,2, 
Sp = G2, =D Sirol X)Z%expiKP,¥> , j= ly...n. 


Let K be an n-dimensional vector such that, for every non-zero coefficient in the 
expansions (15), <Q, K> > 0 and, additionally, if <Q, K> <s, then i¢P,Q> + 
<Q, A> = 0, where s > 0 is some fixed number. That is, system (3) is normalized 
to degree s (compare with section 1.7). 


(15) 


Theorem 2. Let a vector K be given, and let system (3) be normalized to degree 
s. Then there exists a normal form of this system in which the coefficients of the 
terms of degree less than 2s are exactly the coefficients of the corresponding 
resonant terms in system (3). That is Oyy(U) = P¥5(U) and gjpqQ(U) = fipg(U) for 
<Q, K> < 2s and i<P,Q> + <Q, A> = 0. 


We now note that the resonant part of any of the expressions (15) coincides 
with the averaging of the corresponding function along solutions of the system 
X=0, Y=, 2, = 4,2; (Jj = 1,...,n), if Rez; = 0 for ail j. Consequently, the 
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concept of “resonant part” is analagous to the concept of “averaging” but is even 
applicable to situations in which Re A # 0. On the other hand, averaging is 
defined even for non-analytic functions, for which we cannot write expansions 
like those in (15), and hence cannot identify resonant parts. 


Example 1. Let us consider a system in the standard form 
X = eF(X,t) , (16) 


where the vector function F is analytic in X and ¢ on some domain Y and is 
conditionally periodic in t with frequency basis Q = (w,,...,@,). Then system 
(16) is equivalent to the autonomous system 


X=eF(X,Y), Y=Q2, é&=0, (17) 


where F(X, Qt) = F(X,t) and the small parameter ¢ is the unique small co- 
ordinate. By theorem | and property b), the normalizing transformation of 
system (17) has the form 


X=U+ > &H,(U,V) , Y=V, e=eé. 
Ke1 


Here, the solutions of equation (13) are P = 0 and Q = q, an arbitrary integer. 
The normal form 


Os 5 2'G (UV); Vs, S820 


is “averaged over time”. By theorem 2, with K = 1 and s = 1, the function G, is 
the resonant part of the function F(U, Y) or, equivalently, it is the time-average 
of the function F(U, t). Thus, the asymptotic method of Krylov-Bogolyubov (see 
Bogolyubov and Mitropol’skii, 1974) for analytic standard systems is simply a 
special case of the normal form method. The reduction of the system to standard 
form is, in fact, just the transformation to local coordinates. In §2 of chapter V, 
we will discuss the applications of this method in mechanics. 


4.4. A Hamiltonian System 


A Hamiltonian system is distinguished by the fact that it is defined by a single 
Hamiltonian function and by the grouping of all variables (except parameters) 
into canonical pairs. Local coordinates for the manifold .@ also form canonical 
pairs, so that system (3) is also Hamiltonian (see Nekhoroshev, 1972), The Y 
coordinates all belong to different pairs. The Hamiltonian function h of system 
(3) can be expanded in a series 


h = hpg(X)Z2expicP, Y> . (18) 
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The normalizing transformation (9) can be taken to be canonical, so the normal 
form is again a Hamiltonian system; its Hamiltonian is 


h=Y gpg(U)W2expic¢P,V> , (19) 


which contains only resonant terms (as defined by equation (13)). Properties a)—f) 
continue to hold here. 

Let the Hamiltonian (18) depend on one small parameter p and have the form 
h = ho + wh,, where the expansion of hy contains only resonant terms. Then by 
theorem 2, the Hamiltonian of the normal form (19) may be written as 


h = hy + ulh,] + O(n’) , (20) 


where [h, ] is the resonant part of the “perturbation” h,. 


Example 2. The planar circular restricted three-body problem is a Hamiltonian 
system with two degrees of freedom and one parameter y. When p = 0, the system 
is integrable; its phase space contains a one-parameter family of circular motions 
and a continuum of three-dimensional manifolds, corresponding to elliptic 
motions with a fixed frequency basis (see Bruno, 1972c). For circular motions, 
Re 4, = 7" isa cycle (periodic solution), and local coordinates are the second 
system of Poincaré elements. For elliptic solutions, Re. 4, = 7? x #, where # 
is the interval 0 < e < 1; local coordinates are Delaunay elements or the first 
system of Poincaré elements with constant displacement. In such coordinates, 
system (3) is a normal form for u = 0. Thus, for small y, the Hamiltonian of the 
normal form is as in of (20). The Delaunay method (in Krasinskii’s interpreta- 
tion, 1973) corresponds to the reduction to normal form in a neighborhood of a 
cycle .@, (for circular motion) or of a torus 7? in the manifold .4@,. The method 
described above allows us to perform the normalizing transformation at once in 
the neighborhood of the entire manifold ./@, for all values of the eccentricity, 
O<e<l. 


4.5. On Problems in Mechanics which Can Be Solved with the Aid of the 
Normal Form 


Since, as a rule, the normalizing transformation (9) diverges in any neighbor- 
hood of the manifold .@ (see Bruno, 1971, 1972a), the correspondence between 
solutions of the given system (3) and its normal form (10) demands a special study. 
The results which have been obtained in this direction can be classified as follows. 

1) On the stability of the manifold .@. 

a) The asymptotic stability (or instability) of system (3) can be deduced from 
the asymptotic stability (or instability) of system (10). Much work has been done 
here, all of it summarized in hypothesis 2 of Bruno [1970a] (see, for example, 
Starzhinsky, 1972, 1973, 1974, 1977). 
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b) The neutral stability of system (10) implies the formal stability of the system 
(3) (see Nekhoroshev, 1971, 1973a, 1973b, 1977) and in some cases Lyapunov 
stability (see Moser, 1962, 1968). Results of this type were used, for example, to 
investigate the stability of the Lagrange solutions of variants of the restricted 
three-body problem (Markeev, 1973, 1978). 

2) Estimating the effects of the instability of system (3) with the help of the 
system (10). Here, for example, we might calculate the influence of nutational 
oscillations on the drift speed of a heavy gyroscope in a Cardan suspension (§ 1, 
ch. V); or one might examine the mechanism of gap formation in the asteroid 
ring (Bruno, 1970b). 

3) Determining periodic and quasi-periodic solutions of the system (3) with 
the help of the system (10) (see §3 of this chapter). Here, we can find new 
(asymmetric) solutions to the problem of oscillations of a satellite in the plane 
of an elliptic orbit, using the asymptotic method of Krylov-Bogolubov according 
to example | (see §2 of ch. V). The computer calculations for example 2 allow us 
to find all asymmetric periodic solutions of the second kind in the restricted 
three-body problem (Bruno, 1976c). 

The exposition of § 4 follows that of Bruno [1976a, 1976d]; for generalizations 
see Bruno [1979]. 


Chapter IV 
On the Newton Polyhedron 


§1. A System of Differential Equations 


1.1. Introduction 
We will examine the system 
x; = 0X) , i=1,...,n (1) 


in a neighborhood of the point X = (x,,...,x,) = 0, assuming that the functions 
gy, are analytic in this neighborhood. Our presentation will closely follow that of 
the two-dimensional case (Chapter II), but without proofs. The point X = 0 is 
simple if at least one of the functions ¢; is non-zero at X = 0, and singular if 
(0) = 0 for all i. This singular point is called elementary if the eigenvalues 2,, 
., 4, of the matrix A of the linear part of system (1) do not ail vanish. If all the 
A; are zero, then the origin is a non-elementary singular point. 
We will investigate the behavior of the integral curves of system (1) in a 
neighborhood of the origin by introducing new coordinates Y, 


x, = EY) , é(0)=0, i=l,...n, (2) 


in which system (1) takes the simplest form 


W=WAY),  i=leyn. (3) 


We will assume that these functions ¢; are at least smooth, and that the Jacobian 
of transformation (2), D(=)/D(Y), does not vanish at the origin. That is, the 
change of coordinates (2) is invertible. In simpler cases, such a change of coordi- 
nates exists in the entire neighborhood of the origin. In more complicated 
situations, we will have to divide the neighborhood into several parts, %(e), 
and make an appropriate change of coordinates in each part. 

In what follows, our attention will be principally focused on the situation 
n = 3. In those discussions in which n is arbitrary, it would, perhaps, be advisable 
to assume that n = 3 on first reading. 
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1.2. Simple Points and Elementary Singularities 


Theorem 1. If the origin is a simple point of system (1), then there exists a 
change of variables (2), such that, in the new coordinates, system (1) takes the form 


y=l, y,=0, i=2,...,n. 


This theorem is similar to theorem 1 in § 1, chapter II, and has an identical 
proof. It means that exactly one integral curve Yy passes through the origin, 
while the remaining integral curves in the neighborhood of the origin are almost 
parallel to the curve Fo. 

If the origin is a singular point of the system (1), then a linear change of 
coordinates can take the matrix of the linear part into Jordan form. We will 
assume for system (1) that such a transformation has already been applied, so that 


X;, = AX; + 6,Xj-1 +... , i=1,...,n, 


where o; = 0 if J; # 4;_,, and o; = 0 or 1 if A; = A;_,. We introduce the vector 
A =(Ay,...,4,), vectors Q = (q;,---54n), and the notation X2 = x#'...x4. We 
write system (3) in the form 


W=VG=Vi Y GgY?, i=l,....n. (4) 
QeN; 


Since the y,g,(Y) are series in non-negative integral powers of the variables, then 
N; is a set of integral points Q = (q,,..-,4,) for which q, > —1 and q, > 0,j #i. 
We write N = N, u':*UN,,. 


Theorem 2. If the origin is an elementary singular point of system (1), then there 
exists a formal change of variables (2), which takes system (1) into a system (3) for 
which (in the notation of (4)) gig = 0 if <Q, AD # 0. 


That is, system (4) includes only resonant terms y,gig Y°, for which 
(Q,4>= > GA =0. (5) 
i=1 


We call such a system (4) a normal form. This theorem is analogous to theorem 
2 in §1, Chapter II, and the proof is identical (see § 1, Chapter ITI). 

If the original system is real, then for every complex eigenvalue 4, its complex 
conjugate 1 also appears among the numbers 4,, ..., 4,. Let 2,, ..., Ag; be 
complex eigenvalues, 4,., = 2,,..., Ag, = A; and Az)4,,..-, 4, be real. Then we 
can introduce complex-conjugate coordinates x,,, = xX, and apply a normalizing 
transformation which preserves that conjugacy. (Le., yj, = V1,--+, Yor = Y, are 
complex-conjugate variables, while y,,,,,..., ¥, are real; see sect. 1.9 of ch. IID). 

Let 5 be the number of linearly independent integral vectors Q € N which 
satisfy equation (5): we will examine several cases. 
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1) 6 = 0. In this case, the normal form has the form 
¥,= 4; 5 CA, sa 9ns 


Its solutions are y; = c;exp4,(t — to). The normalizing transformation does not 
always converge in this case (see Bruno, 1971, 1972a), but for real systems there 
always exists a smooth normalizing transformation (see Sternberg, 1958, 1959). 
An example of this case is A; = 1, 4, = fd, A= J3. 

2) 6 = 1. Let a vector REN satisfy the equation <R, A> = 0, and let the 
greatest common factor of its components r,, ..., 7, be unity (see sect. 2.3 of ch. 
II). 

If R > O, then solutions Q € N of equation (5) are vectors Q = kR, where k is 
a non-negative integer. The normal form therefore has the form 


Yi=Ay + vGlY) , g, = Y 9, ¥"® , i=1,...,n. (6) 
k= 


Here the series g; depend only on the product Y*%. We shall make this a new 
independent variable. Letting S, = R, we choose n — 1 integral vectors S,,..., 
S, such that the determinant of the matrix, whose columns are S,,..., S,, is unity. 
We then apply the power transformation 


Z= y* , B= lesan « (6’) 


Then system (6) is transformed into the system 
(In z,) = g} => Gees PS Lain (7) 
where gig = <R, A> = 0. System (7) is easily integrated to 


Inz; =c¢,; + jaeeet Z 
1 2,9;(2,) 

As a rule, the normalizing transformation diverges (see section 3.2 of Chapter III 
and Bruno, 1971, 1972a), but in many cases there exists a smooth normalizing 
transformation, for example, if Re 4; 4 0 for all i (see Sternberg, 1958, 1959), 
However, even when n = 3 we find cases when there is no smooth normalizing 
transformation. An example is 4, = 0, 4, = 1, 4; = ~./2 with g, =0 but 
92/93 # 42/3 in the normal form (6). 

If one of the coordinates of R is —1(r, = —1, say), then the normal form has 
the form 

Vi = Ai» i=1,....n-1, 


Yn = Ann + ayy... Vat a 


This system is easily integrable. 
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3) 6 = 2. Let R! and R? be two integral vectors which form a basis of the set 
of integral solutions of equation (5); that is, the greatest common divisor of all 


the minors of the matrix 
R} = ae eee 7) 
R? PR ote eae 


is unity. Then there exist integral vectors S3,..., S, (see Bruno, 1965, as well as 
sect. 2.1, ch. III) such that the determinant of the matrix with columns R?, R?, 


S;, ..., S, is unity. Then the power transformation (6’), where S, = R' and 
S, = R?, takes the normal form into the system 


(In'z;) = gi(z1,22) ’ D= liad His 
A subsystem for the first two variables can be separated from this system 
(Inz,) = 921,22), i=12. (8) 
If we find z,(t) and z,(t), then the remaining variables can be found from 
Inz,;=Jgjlz.(0,2.()) dt, =f =3,...,0. 


We must examine subsystem (8) in a neighborhood of the origin z, = z, = 0, 
which is a non-elementary singular point. We already know how to solve such 
a problem (see §3 of ch. II). To do so, we divide the neighborhood of the origin 
into sets %(e), in each of which we then introduce a new set of variables in 
which system (8) is integrable. In these cases transformation (2) diverges as a rule. 
We have to seek a smooth transformation to an integrable system in sets 
corresponding to the sets UE) of system (8), rather than in the entire neighbor- 
hood of the origin. But situations may arise in which such a smooth transfor- 
mation exists only in some of these sets, not in all of them. Then a smooth 
normalizing transformation does not exist. If n = 3, then 6 = 2 only if all the 4, 
are real and pairwise commensurable, or else 4; =0 and 4, =A, are pure 
imaginary. 

4) 6 > 3. Then we can find integral solutions of equation (5) R',..., R°, which 
can be supplemented with integral vectors S;,,,..., 8, such that the determinant 
of the matrix with columns R',..., R®, S;4,,.-., 5, is equal to unity. Then the 
power transformation (6’), with S; = R', i= 1, ..., 6, takes the normal form (4) 
into the system 


(In'z;) = gi(Z1,---525), t=ly..n. (8’) 


(see sect. 2.1 of ch. IIT). From this system, we can separate out an independent 
subsystem 
2; = Z:9)(Z1,--+525) » i=1,...,6 (9) 


of 6 equations. The remaining n — 6 variables can be found from the first 6 by 
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quadrature. We must investigate system (9) in the neighborhood of the point 
Z, = ++: =z, = 0, which is a non-elementary singular point. If 6 > 2, this is a 
difficult problem. Thus, we arrive at the problem of examining non-elementary 
singular points. The transformation to normal form uses the linear terms of the 
original system to reduce the order of the system from n to 6 <n; but if 6 > 1, 
we must study further this system of order 6. 


1,3. The Generalized Normal Form 
Let R,, R>,..., R, be vectors such that, for some vector T, we have 
<R,T> <0, rie) Perey aa 
Then the set of points 
V={0:0=8B,R, +°*' + BR, B > 0} 


is a polyhedral convex cone. We will call the series f = bt foX® a series of the 
class ¥ if all the exponents Q in the series are integral vectors that belong to the 
cone V. 

Consider the system 


(Iinx,) =f(X)=VfigX®, i=l..n, (10) 


where the /; are series of class ¥; let us write A = Fy = (fio.---, fro). If the series 
f, converge, they do so on a set of the form 


Uy = {X:|X|F <ei=1,..., k}. 


Theorem 3. If, in system (10), the f; are series of class V, then there exists a 
formal change of variables (2), where the y;'€, are series of class WV, which takes 
system (1) into a normal form (4) in which the g; are series of class ¥V and gig = 0 


if <Q, A> #0. 
This is an analog to theorem 1, §2 in chapter I; their proofs are identical. 
The set Wy consists of curvilinear cones which abut the origin. 
1.4. The Polyhedron and the Normal Cones 


Let D be a set of points Q = (q;,...,q,,) in the n-dimensional real space Rj, 
and let R3 be the dual space to Rj. We now consider the following problem: for 
every P € R5, find a subset D> of D, such that 


<Q,P> =rforQeD,, 
<Q,P><rforQeD\D, , 
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where 
r=r(P) = max <O,P) . 
@eD 


In order to solve this problem, we first note that, for a fixed vector P, the 
equation 
<Q, P> = co = const (11) 


defines a hyperplane L in R{ which divides that space into two half-spaces: 
positive L™ = {<Q, P> > co} and negative L™ = {<Q, P> < co}. The hyper- 
plane (11) is a supporting hyperplane for the set D if its positive half-space has no 
points of the set D and if for any hyperplane <P,Q> =c < cp, its positive 
half-space contains points of the set D. We will denote by Lp the supporting 
hyperplane which corresponds to the vector P. Its negative half-space Lt” is the 
supporting half-space for the set D. Clearly, Dp = Lp D. 

In order to describe the sets D, for different vectors P, we need to consider 
the convex hull 4 of the set D: 


4= {0:0 =6,0, +--+ 6,0,,Q;€ D,d, > 0,5 6; = 1} . 


We denote by I the intersection of all the supporting half-spaces LY” of the set 
D. It can be shown that the set J coincides with the closure of the set 4 (see 
theorem 1, §1, ch. I). We call the intersection of the set / with the supporting 
hyperplane L, a face. The boundary of the closed set F consists of faces of 
different dimensions (a vertex is a face of dimension zero, an edge is a one- 
dimensional face, etc.). We will denote each of the faces of F by 15, where d is 
its dimension and j is its number. We write D® = Do ®. If 5° =LeoS, 
then, clearly, Dp = D{; that is, all the boundary subsets D> are the subsets Di” 
which lie on the faces of I. 

For some fixed /;, let us consider the set U/® of vectors P € R43 for which 
LpaT = J. This set 


p.<2:P>=<Q",P>, Ge ri, Q" « ri” : 
<Q',P> > <Q,P> , ger 


is a convex cone which we call the normal cone of the face [;. 
Let us also consider the cone VW which is normal to U{*; that is 


V = {0:<0,P)<0, PeU}. 


UM = 


J 


We call this the tangent cone to the face 5. 

Under rather weak restrictions on the set D, the set "is a polyhedron and its 
boundary él consists of a finite number of faces 1. This is the case, for example, 
if the set D consists of a finite number of points (see Goldman and Tucker, 1956); 
there, both U and V! are polyhedral cones. 

Let the dimension of the polyhedron J itself be n. Then the normal cone 
Us""" of the face F""” is a ray orthogonal to F""" and directed out of I. The 
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tangent cone V{""") is the half-space bounded by the hyperplane which contains 
the point Q = 0 and is parallel to the face J'""'’. The normal cone U%"”? is a 
two-dimensional sector of the plane orthogonal to the face /;""”. The tangent 
cone V{""?) is the intersection of two half-spaces. The normal cone U}’? of the 
edge J}") is a part of the hyperplane normal to /;''); the tangent cone V$") is 
n-dimensional, and contains the line which is parallel to the edge J; and passes 
through the origin Q@ = 0. The norma! cone of a vertex is an n-dimensional 
polyhedral cone; the tangent cone of a vertex 7° = O is likewise a polyhedral 
convex cone of dimension n. It is the convex hull of the set / — O (that is, the 
polyhedron I is parallel transported such that the vertex 7° becomes the 
coordinate origin). In general, the tangent cone vip) is the convex hull of the set 
I —O, where 0 is any interior point of the face I; (that is, the point O does 
not lie on a face of dimension less than d). 


Example 1. Let n = 3 and D = {Q,,...,Q.5}, where Q, = (0,0, —1), Q, = 
(0, = 1,0), Q; = (- 1,0, 0), 04 = (0, 0, 0), QO; = (1,0, 0), 06 = (0, 1,0). The poly- 
hedron J is pictured in figure 82. Its boundary OF consists of five vertices, eight 
edges, and five faces. The normal cone corresponding to the face /’{?) with vertices 
01, Q3, Qe is 
U2) 2 de. (01, P> = (Q3,P> = (Q6,P> At 
U"<Q1,P> > <Q2,P> 5 
or 
UP = {P: —p3 = —P, = P2 > —P2} » 
which is the ray 
Ui = e(-1,1,—-1) , c>0. 


The tangent cone V~ is the half-space <Q, P, > < 0, where P, = (—1,1, —1). That 
is, 
VP = {2:41 —42+4320} . 


The normal cone of the edge I’{’) with vertices Q, and Q, is 
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yor a Sp. Qu P> =<QeP>, Oi, P> > <Os,P> } 
" " U"<Q1,P> > <Qs,P> 
or 
UP = {P: —ps = pa, — Py < —D3,P1 < —P3} - 
This is a two-dimensional angle which lies in the p, = — p, plane and is bounded 


by the rays spanned by the vectors P, and P, = (—1, —1,1). The tangent cone 
V{!) is the dihedral angle defined by the inequalities 


2.P><9, <O,P,><0. 
That is, 


VP = {0: —41 + 42 ~ 43 <9, — 4, — G2 + G3 <0} . 


The normal cone corresponding to the vertex '{ = Q, is drawn in figure 83. 
As can be seen, the normal cone of a vertex is bounded by the normal cones of 
the edges which meet at that vertex. In formal notation, 


U = (P: (01, P) > OQ), P>.j = 2,3, 5,6} « 


Thus, for n = 2 or 3, the faces { and their normal cones U!” can be found 
from the corresponding drawings. But if n > 3, the boundary subsets D(® and 
their normal cones can be found from linear relationships among the vectors 
Q € D (see Bruno, 1973b, Soleev, 1983). In particular, the faces [5 and their 
normal cones U" can be found knowing only the vertices of the polyhedron I. 
Moreover, the normal cone U and the tangent cone V™ can be described using 
just the edges which lie on or are adjacent to the face J}. Let the edges I",..., 
Ti lie in a face 15, and let edges UY}, ..., H) adjoin it. We denote by R, the 


unit vector along an edge /;{"’, so that the vectors R;4,,..., R,, are directed out 
of TF; along edges J}, ..., Hi), respectively. Then the normal cone is 
(Ry, P> =0, T= lyaaalhs | 

UM = <P: ? ‘ 12 
J | (Ry P> <0, ka=ltth...ym| (12) 

P3 

P2 
0 


P| 


Fig. 83 
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and the tangent cone is 
VO = {0:0 =yRi +--+ WR + Or Riar + 77° + Om Rms Se S OF - 


The normal cones of different faces do not intersect, and their union constitutes 
the normal cone U of the entire polyhedron 7. For example, if the set D lies in 
the positive orthant Q > 0 (e., quadrant, octant, and so on), then the cone U 
contains the negative orthant P < 0. 

Thus, in order to identify the boundary subsets D and their normal cones 
U, we must examine the polyhedron I, identify its faces J, and construct for 
each of these a normal cone U™. 

For a sum 

f(X) = Lfox? (12’) 
the set 
D = D(f) = {Q:fo 4 0} 


is called the support. The closure of its convex hull is / = I'(f), the Newton 
polyhedron. To each face [{ of F there corresponds a truncation 


f= ¥ fox? 
Qe Di? 
with respect to the vector order P € U™. 
We denote by ¥; the class of power series (12’), the supports of which lie in 
the tangent cone V‘. Together with the normal cone (12), we define for small 
é > 0, the set 


Ine <¢R;,P> < —Ine , P= neh 


{ 
Ue = P: » 
pe (Ry P> <ine , k=14+1,...,mJ 


(13) 
If d = 0, then the set U{(e) is the set of values P = In|X| for which a series of 
class ¥; may converge. 
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We consider a system (10) in which the x; f,(X) are analytic functions, and the 
eigenvalues j; are all equal to zero. We place each vector Qe N in corre- 
spondence with a vector coefficient Fg = (fig,---, fag): We now examine the 
support D of the series ¥ FyX®, that is, the set D(F) of vector exponents with 
non-zero coefficients Fo: 


D = D(F) = {Q: Fp #0} . 


Let [ = I'(F) be the closure of the convex hull of the set D. Generally speaking, 
Tis a convex polyhedron (see Goldman and Tucker, 1956, Bruno, 1965, 1973b). 
It is the Newton polyhedron of the system (10). The boundary @7 of I consists 
of vertices, edges, faces, etc. We will denote all of these faces of varying dimension 
by 7, where d is the dimension and j is the number (see Bruno, 1973b). Thus 
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Fig. 84 


the J} are vertices, the J7'") edges, and so forth. For example, for the system 
X =x? + 5x?x,x3 = X4(X, + 5x, X2X3) , 


Xa(X2 + 4x,x2%3) , 


ll 


X_ = xR + 4x, x3x5 
X3 = H3 + 3x, X2xZ = x4(x3 + 3X, X2x3) , 
Q, = (1,0,0) > 2 = (0, 1,0) > Q; = (0,0, 1) ? 0, = (1,1, 1) > 


Fisa tetrahedron with four vertices °, six edges Jj”, and four faces J; (figure 
84). 

For a vector P, the supporting hyperplane L, of the set D is defined by the 
equation 


<Q, P> = max <Q;,P> . 
Q;eD 


The supporting hyperplane Lp intersects the polyhedron I at some face 5. 
Since we are interested in the neighborhood of the origin, we need to identify 
those faces J; which lie in the supporting hyperplanes Lp with P < 0. Let this 
be done. On every edge /;) we find a unit vector +R;, which is uniquely 
determined (except for sign), as the difference between neighboring integral points 
on the edge Jj"). Now let 5 be a face of some dimension 0 < d < n. Suppose 
that the edges '{), ..., lie on this face (if d = 0, there are none), and let the 
edges [),,,..., 1, adjoin the face. We choose the vectors R,,..., R, with arbitrary 
direction (sign), but select the vectors R,,,,---, R,, so that they are directed out 


of the face 5. We place in correspondence with 1 a set in X space: 


jx Se gig T= Logs } 


Me) = 2X: 
a) [X|Fk <e, k=14+1,...,m 


This is the set of all X for which P = In| X| belongs to U!(e) (see expression (13)). 
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Taken together, all of the sets %(e) (for any positive e < 1) form an entire 
neighborhood of the origin X = 0. In each of these sets %{(e), we introduce its 
own set of coordinates in which the system of differential equations becomes 
simpler. This simplification proceeds differently for different dimensions d of the 
face F. 

1) d =0. To investigate system (10) in the set %(e), corresponding to the 
vertex 1{°) = 0, we make a change of the time variable, dt, = X@ dt (reduction 
by X®). Then the vertex 1 becomes the origin, Fy = A, and theorem 3 on the 
generalized normal form applies. That is, system (10) can be transformed, under 
a change of coordinates of class HG} to a normal form; this, in turn, can be 
transformed by a power transformation into a system (8’), where everything 
reduces to investigating a system of form (9) and of order 6 < n. For this system, 
we must once again examine the neighborhood of a fixed point, but the order of 
the system has already been reduced. So we must once more construct a poly- 
hedron, divide the neighborhood of the origin into sets %” (e), which corre- 
sponds to dividing the set %{(e) into parts W\P(e). In each of the sets ¥, () 
we make a simplification of the system, and we continue the process until we 
arrive at an integrable system. 

2) d = 1. To study the behavior of system (10) in a set &@"(e), corresponding 
to an edge J}, we must perform a power transformation which turns the edge 
T;” into a vertical edge, parallel to the qj, axis. To do so, we find a set of integral 
vectors S,,...,5,—, such that the determinant of the matrix with columns §,,..., 
S,-1, R; is equal to unity (recall that R; is the unit vector along the edge J;”). 
We let y, = X*,i=1,...,0—1, y, = X*®. Then the set {(e) is transformed 
into a neighborhood of part of the y, axis, ¢ < |y,| < e 1. In order to study the 
system in this neighborhood, we must perform a reduction by the maximal 
powers of y,,..., y,—1, then find the singular points y? # 0, oo on the y,, axis and 
separately study their neighborhoods. We divide the remainder of the neighbor- 
hood of the axis into parts, in each of which theorem | on simple points applies 
and the system is integrable. All of this corresponds to dividing the set %{"(e) 
into parts. In those parts corresponding to simple points on the y, axis, the system 
is integrable; in those parts corresponding to singular points y?, we must further 
investigate the neighborhoods of such points. If the singular point is elementary, 
then we must put the system into normal form and investigate the transformed, 
lower-order system. If it is a non-elementary singular point, we must once again 
construct a polyhedron, divide the neighborhood into parts, and so forth. But 
this is exactly the problem with which we began, though the singularity is simpler 
then it was in the original X variables, since it has already been partially resolved. 

3) d = 2. Let the vectors R’ and R? form an integral basis on the face 5), 
i.e., the greatest common divisor of all the minors of matrix (7’) is unity and the 
plane spanned by R‘ and R? is parallel to the face 5”. We supplement the 
vectors R’ and R? with vectors S,,..., S, to form a basis of the entire space 
R{; we then perform the power transformation 
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y= xe, i=1,....n—2, Ya-1 = X* > Yn = XR = 


Then the face 7?” is parallel to the q,_,, g, coordinate plane. Now we make a 
reduction by the maximal powers of y,,..., y,-2. Under our power transforma- 
tion, the set &{?)(c) becomes a set Y?) (e), which is a neighborhood of the part of 
the y,_,, y, plane defined by 

E<1lynal<e", € <n] Se" 
That is, the problem of investigating solutions in the neighborhood of a singular 
point is reduced to the problem of studying solutions in the neighborhood of 
part of a plane. As a rule the y,_,, y, plane is invariant, and we should begin by 
finding solutions in that plane. We do not know how to completely solve this 
probiem, but we can divide the plane into parts, such that in each part either all 
points are simple and theorem | applies, or there is one fixed point which we can 
investigate separately. In this case, closed integral curves (for example, limit 
cycles) will be cut into pieces which will lie in different parts of the y,_,, y, plane. 
Division of this plane into pieces corresponds to division of the set %{”(e) into 
pieces. In pieces corresponding to simple points, the system is integrable; in pieces 
corresponding to singular points, we must continue our process. Since, in each 
step, either the singularity is simplified or the dimension is reduced, we must 
arrive at an integrable system in a finite number of steps. 

In principle, the local method allows us to investigate solutions in the 
neighborhood of periodic solutions. Therefore, in the division of the y,_,, y, plane 
we may consider pieces similar to an annulus, a neighborhood of a periodic 
solution. But three difficulties arise here: finding a periodic solution and normal- 
izing a system in its neighborhood are transcendental (non-algebraic) operations; 
and an analytic, periodic solution is not, as a rule, algebraic itself. In general, in 
investigating system (10) in a set Y{*(e), we can use a coordinate change (2), where 
the y;'é, are series of class ¥; (compare with section 3.10 of Chapter I and §4, 
Chapter IJ). 

4) d > 2. Let the vectors R',..., R@ form an integral basis of the face 1, and 
let the vectors S,,..., 5,_~ supplement them to form an integral basis of the entire 
space. Then the power transformation y; = X*,i = 1,...,n — d; Yp-aaj = X™, 
j=1,...,d transforms J5® into a face parallel to the q,_ai1, ..., q, coordinate 
subspace. We must then perform a reduction of the system (in Y) by the maximal 
powers of y,,..., Yaa The set Z(e) becomes a neighborhood of part of the 
Vn—dt1>+++> Yn COOTdinate subspace: 

e<|y,-ajl<e', jol,...,d. 
We divide this part of the subspace into pieces, such that in each piece either 
theorem | applies or there is one singular point. We must now investigate the 
neighborhood of each of the singular points y°_,,,,..., y2; these singularities are 
simpler than the original singularity. 
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Thus, we must resolve singularities, reducing the investigation of solutions in 
the set %{(c) (with d > 0) to the investigation of the neighborhoods of a few 
singular points. At elementary singular points, as well as in the sets %{(e), we 
can apply theorems 2 and 3 on the normal form. This, together with appropriate 
power transformations, reduces the order of the system. Thus, our procedure 
simplifies nonelementary singularities and reduces the order of the system. After 
a finite number of steps, the neighborhood of the origin X = 0 of system (10) is 
divided into a finite number of sets %{4"); @(¢), where the indices j,,...,, indicate 
repeated divisions; in each of these sets new variables are introduced in which 
the original system is integrable. 


1.6. Basic Mathematical Problems 


The method, presented in preceding sections, of locally integrating system (1) 
in the neighborhood of a singular point has three fundamental aspects: 
I. The formal-algebraic aspect. 
II. The interpretation of formal transformations. 
III. The synthesis of the different sets W{*(e) into a general picture. 
We now consider these aspects in detail. 


I. The formal-algebraic problem. We have three kinds of changes of variables: 

a) the formal normalizing transformation; 

b) the power transformation; 

c) the parallel translation. 
The question is, what is the simplest system (3) to which we can take an analytic 
system (1) using these transformations? That is, we seek all such formal systems 
(3) which cannot be simplified by the transformations mentioned and to which 
system (1) reduces. The question arises as to the finiteness of the number of 
simplest systems (3) obtained from one original system. There is also the question 
of the finiteness of the number of transformations a), b) or c) needed for such a 
reduction. In addition we must explain whether all of these simplest systems 
(3) are integrable. In the two-dimensional case (n = 2), everything reduced to 
the integration of a system of order 6 <n, ie, d =0 or 6 = 1. Such systems 
are always integrable. These simplest systems are integrable in the general case 
as well. 


II. Interpretation of formal transformations. Since normalizing transforma- 
tions are given as power series which, as a rule, diverge, the question arises of 
how to interpret these formal transformations. The problem is posed thus: With 
the help of transformations of types a), b), and c), let the original system (1) in a 
set U* be reduced to the simplest system (3). We seek to determine the properties 
of solutions of a system (1) in the set W@* from system (3). There are several possible 
approaches here—analytic, smooth, topological, formal, and others. That is, we 
can seek an analytic correspondence between the solutions of the original system 
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(1) in the set @* and the solutions of the simplest system (3). (See § 3, Chapter 
III). Likewise, we can seek a smooth or continuous transformation from system 
(1) to system (3). It is also possible to investigate the properties of solutions of 
system (1) through the properties of the solutions of system (3). For instance, if 
system (3) has no solutions which approach the singular point, then if there are 
such solutions of (1), they must approach the singular point very slowly. This 
leads to the concept of “formal stability” (see sect. 4.5, Ch. II1). By studying system 
(3), it is also possible to describe the properties of solutions of system (1) in a 
subset of the set %* of almost full measure. In my view, the approaches which 
give the best perspective are the smooth and formal interpretations. 


III. Synthesis. The method of local integration reduces to the division of the 
neighborhood of the singularity being investigated into sets 


US AO(e) , (14) 


in each of which we introduce new variables for which the system is integrable. 
It is then necessary to sew together the parts of the solutions found in these pieces 
of the neighborhood. In some cases, this is not terribly important, but in other 
cases it is fundamental to the problem. For instance, it is quite important if we 
wish to determine the stability of a singular point when n = 2. If there exist 
integral curves which approach the origin within one of the pieces of set (14), 
then the question of sewing together is not important. If there are no such 
solutions, however, we want to be able to distinguish between a center and a 
focus and discover the character of a focus. To solve this problem with the local 
method, it is necessary to divide the neighborhood into sets (14) and to find the 
solutions in each set to a high degree of accuracy. These solutions define point- 
wise transformations from one boundary of a curvilinear sector to another. Thus, 
it is necessary to go through all the sectors of all sets (14) and determine the 
direction of the displacement of a point after a full rotation around the origin. If 
(asymptotically as x? + x3 +0) this displacement is greater than the error due 
to approximation, then the origin is a focus and we know its character; if the 
displacement is less than the error, then we must calculate the solutions with 
greater accuracy. 


Remark. The local method allows us to examine the dependence of solutions 
of system (1) on some small parameter y (or several small parameters). We need 
only add the equation ft = 0 to system (1): It is then possible with the local method 
to study various bifurcations, the creation of periodic solutions, and other local 
phenomena (see sects. 1.6 and 3.4, Ch. III). 

The Newton polyhedron of a Hamiltonian function allows us to find trun- 
cated Hamiltonian systems (see Bruno, 1978a). 


§2. Other Problems Using the Newton Polyhedron 


The applications of Newton’s polygon to different areas of mathematics have 
been enumerated in a review by Chebotar’ev [1943]. Since then, the Newton 
polyhedron has appeared (Bruno, 1962), and a wide range of investigations has 
been undertaken employing methods of the geometry of exponents. In this part 
of the chapter, we present a brief survey of those results known to the author 
(excluding systems of ordinary differential equations, which have considered in 
§ 1 of this chapter). 


2.1. The Hadamard Open Polygon 


We consider a polynomial in one variable 


S(z) = ¥ a,z* (1) 
k=0 
and let J, = In|a,|; then 
f =¥ 6, exp(i, + kin|z|) , 


where {@,{ = 1. In order to select the terms a,z* which are largest in modulus, 
i.e., to find 
max (I, + kIn|z|) , (2) 
k 


it is convenient to let « = In|z{| and employ a method similar to that of the 
Newton open polygon. We create a correspondence between each term a,z* and 
a point k = k,! = 1, = In|a,| in the k, | plane. Then the maximum (2) is attained 
on the boundary of the convex hull of all these points. This boundary is an open 
polygon like the Newton open polygon; it was introduced by Hadamard [1896] 
to investigate the growth of entire functions (1) with m = oo. 

For example, if the number z is a root of the equation f = 0, then the number 
of leading terms a,z* must be greater than one (assuming that the sum of the 
remaining terms is small). That is, the maximum (2) is attained at several points 
(k,1,) at once. This is possible only when these points lie on one edge of the 
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Hadamard open polygon. The slope of this edge determines (approximately) the 
value of « = In|z|. See Valiron [1954] as well. 


2.2. The Number of Solutions of a System of Algebraic Equations 


Let a system of algebraic equations in n complex unknowns, X = (x,,...,X,)s 
be given: 
f(X)=0, i=1,...,n (3) 


Let D; = D(f;) be the support of the polynomial f;, We might ask whether we 
can determine the number of solutions of system (3) from the sets D;. Bernshtein 
[1975] found the answer to this question: the number of solutions does not exceed 
the Minkowski mixed volume of the polyhedrons I(f,) (see Busemann, 1958), 
and is equal to that volume in the general case. From this important theorem 
follow Kushnirenko’s results [Kushnirenko, 1975a, 1975b, Kouchnirenko, 1976] 
on the number of roots and the multiplicity of critical points of an analytic 
function (see also Bernshtein, Kushnirenko, and Khovanskii, 1976, Kushnirenko, 
1976, Khovanskii, 1978, 1983). 

Varchenko [1976] employed the Newton polyhedron to calculate the asymp- 
totics of the integral of a rapidly oscillating function. 


2.3. The Separation of Branches of an Algebraic Curve 


Let the point X = 0 be a singular point of the algebraic curve ¥ which is 
defined by the system of equations 


f(Xy=0, tatsn-1. (4) 


That is, f; = 0 and 6f,/0x; = 0 at X = 0. We can find all of the branches of ¥ 
which pass through the origin in the following way. For eachi = 1,...,n — | we 


construct a Newton polyhedron J; = I'(f;) and accompanying objects FEAL. 


U\?, and UM(E). We then seek the branches of ¥ in each of the intersections 
Me) OO Use Ae (5) 
After a suitable power transformation 


eae alan (6) 


with a unimodular matrix « = (a;;), we obtain in the image of set (5) the trunca- 
tions of the polynomials f; in the form 


fi = yEh(y, Te Yoma dl i=l,...,n—1. 
System (4), for y,, # 0, is equivalent to the system 


A HN. Yea) t=O, Hdd. (7) 
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The desired solutions of this system lie near solutions of the system 
h(yys-- nr =0O, i=lj...,n—-b, y,=0. (8) 


If a solution Y = Y° of system (8) is a simple point of system (7), then the 
corresponding branch of the curve ¥ can be found by the implicit function 
theorem. If Y = Y° is a singular point of system (7), then the translation Z = 
Y — Y° will place it at the origin and the procedure of resolving singularities can 
proceed. After a finite number of steps, as a rule, it is possible to isolate all of the 
branches of the curve ¥ (see Bruno, 1972e, 1973a, Soleev, 1983). 

For n > 2, the determination of the faces of the polyhedron /(f) and the 
investigation of their relative disposition is not a simple problem. If the set D(/) 
consists of a finite number of points, this can be accomplished by testing all 
possible combinations. A preprint by the author [Bruno, 1973b] presents a less 
difficult approach. See also Soleev [1983]. If n = 3, it is possible to use graphic 
methods. 

Bugaev [1888] proposed finding the branches of a curve by testing all 
possibilities; he called it “the principle of greatest and least power exponents”. 
Sintsov [1898] proposed a geometric method for n = 3, which can be explained 
as follows. We consider the normal cones U{? of the polyhedrons J; in the space 
R35, dual to R}. Then we consider the cross-section of these cones by some 
hyperplane (say, p, = — 1). When n = 3, this hyperplane is two-dimensional, and 
the full picture of this cross-section can be represented graphically with straight 
lines of various colors in a plane. These lines are drawn such that each function 
f; has its particular color of lines. Then it is possible to find all the intersections 
of the cones U{? and the corresponding truncated functions. 

Analogous methods have been applied to the determination of solutions of 
linear differential equations (see Sintsov, 1898, Chebotar’ev, 1943, Valiron, 1954). 

Note that we could solve system (4) by eliminating n-2 coordinates, writing 
the system as a single equation 


g(X1,X2) =0 > 


and solving with Newton’s method. But the process of eliminating variables leads 
to a great increase in the order of the polynomials involved (see Vainberg 
and Trenogin, 1969). Hence, the methods discussed above require much less 
computation. 

Botashev [1976] suggests a compromise method. Instead of using the 
Newton polyhedron, he creates a correspondence between each monomial 
Fox?! ...xdey'xf" and a point p = q; +°** + Gn-1. 9 =, in the p, g plane. He 
then constructs a Newton polygon for sets of such points, finds a first approxi- 
mation to the function, and so forth. The calculations of this method are thus 
simpler than those of the method of elimination, but are more complicated than 
the computations of the Newton polyhedron method. 
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2.4. Linear Partial Differential Equations 


Let 
L0)= 2 Ser? (9) 
Q@eD 
: é é \ 
be some polynomial, let Z = (foe be the differential operator, and 
Ox, OX, 


consider the linear differential equation for a function u(X): 
f(Du=0. (10) 


In recent years, much work has been done to investigate the properties of 
solutions of equation (10) using the properties of the Newton polyhedron I(f) 
of the symbolic polynomial (9) (see Mikhailov, 1963, Friberg, 1967, Volevich and 
Gindikin, 1968, 1985, Volevich, 1974, and Gindikin, 1974). 

More general equations are also considered. For example, equations with 
variable coefficients have been examined, and classes of operators with the 
necessary properties have been distinguished in terms of the Newton polyhedron. 


Chapter V 
Applications of the Normal Form in Mechanics 


§1. On the Motion of a Gyroscope in a Cardan Suspension 


This part of the chapter is an illustration of the application of the results of 
§1, Chapter IT and §§1, 2, Chapter III. We will calculate the effect of small 
nutational oscillations on the rate of precession of a gyroscope by reducing the 
equation of motion to the normal form. We will then show that in the given case 
the normalizing transformation converges. We will make calculations for a heavy 
gyroscope with respect to squares and cubes of the amplitude of the nutational 
oscillations. We will also consider a non-static gyroscope. The mathematical 
apparatus is presented in sections 1.1-1.4, and its applications to mechanical 
problems occupy sections 1.5-1.10. These results appeared in an earlier work 
(Bruno, 1972d). Some of the results were repeated by Zhuravlev [1976]. [Trans- 
lators note: a Cardan suspension is the same as a suspension in gimbals. ] 


1.1. Reduction to a Normal Form 
We consider a system 
X; = O(X) , 00) =90 , i=GARIS4 (1) 
which is analytic at the origin X = 0. Let 4,, ..., 4, be the eigenvalues of the 


matrix (0@;/0x;)y. We expand the functions @; as power series, and we will seek 
an invertible change of coordinates 


x, = C(Y) , €(0)=0, i=1,....n, (2) 


where the ¢; are power series, such that in the Y coordinates system (1) will take 
on the simplest form 


W=W(Y), i=l... (3) 


This is a so-called normal form, in which the series ; contain only resonant terms 
cyP!... yPn for which 


A, = Pidr + p24, + + Padn - 


According to theorem 1,§ 1, Ch. IH], for every system (1) there exists a normalizing 
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transformation (2), where the €; are formal power series. As a rule, this trans- 
formation diverges (Bruno, 1971, 1972a). 


Theorem 1. If, in the analytic system 


X; = P(X, X2) > i 1, 2,3 > 


(0, 0) = (0, 0) =0 , Ay a —A, 


(4) 


the subsystem of the first two equations has an analytic integral, then there exists 
a convergent normalizing transformation 


x, = E(¥1,Y2) ’ €(0,0) = 0 > i= 1,2 > 


(5) 
X3 = y3 + O(V1,2) 5 (0,0) =0 , 
which takes system (4) into the normal form 
W=AMty py 9 V1 V2) = yA, +9). 
2 = —y2lA, +9) » (6) 


i 


Vs= VY A(iy2* = f. 
k=0 

Proof. Since the first two equations form an independent subsystem, we begin 
by finding the transformations (5.1) and (5.2) which put this subsystem in normal 
form. The proof that this normal form has the form (6.1), (6.2), and that the 
normalizing transformation converges, can be found in §§ 14, 15 of Siegel’s book 
[Siegel, 1956] (see also § 1, Chapter IT); the existence of the integral is essential to 
that proof. Having applied the transformation (5.1), (5.2) to system (4), we obtain 


X3 = (Vis V2) = P3(S15 02) (7) 


and we will seek a coordinate change (5.3) which transforms (7) into (6.3). 
Differentiating formula (5.3) with respect to ¢ and replacing x3, ,, y2, and jy, 
with their expressions from formulae (6) and (7), we obtain an equation for series 
in y, and y,: 
ar \ 
ip R — 5,22) +49) . (8) 
\Oy, dy2 


We decompose the series €(y,, y2) into an infinite sum 
C=L Oy, y2) » 
s 


where the ¢°) are series containing only those terms cy{ yf? from the expansion 
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of € such that s = p, — pz. We similarly obtain 4 = ),y. Then 


ay ar 

arts) oc!) Soa na 
TO 2 ee a a 
oy, Oy2 


Since the only terms cy?! y$2 which appear in the series f are those for which 
S = P, — Pz = O, then (8) yields 


n® = f(yiy2) , 


1% =(A, + g)sO, s#0. 


(9) 


This in turn leads to 
yn 
ys) _ 


s(A, +9) ’ 
(s) 


1 n 
rere sa 


It is evident that the series € converges, since the series g and 4 converge and 
|s| > 1. Formula (9) shows that the series f is simply the sum of the terms c(y, y2)* 
from the series expansion of 4. The theorem is proved. [] 
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Now suppose that the original system (4) is real (that is, the g, are real-valued 
for real values of the variables), but that the eigenvalues A; are pure imaginary. 
According to section 1.9, chapter II and section 1.9, chapter III, for real values 
of the X variables the variables y, and y, will be complex conjugates and y; will 
be real. The solution of system (6) is 


Yi ¥2 =h=const , 


yy =ylexp(A,t + g(h)t) , 


(11) 
Yo = yoexp(—A,t — g(h)t) , 
Ys =y$+ flr) , 
where 
Sih) = ¥ fht. (12) 
K=0 


For real t, Re y, and Im y, perform periodic oscillations about the origin, while 
y3(t) grows from an initial value with speed f(h) (see (12)). Returning to our 
original coordinates with our analytic change of coordinates (5), we note that x, 
and x, perform periodic oscillations about the origin, while the variable x, 
oscillates and diverges from the origin with a mean speed (12), which depends 
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on the initial conditions x? and x? through the value of the integral h. To find 
f(h), we must calculate the coefficients f, and express h in terms of x, and x,. In 
section 1.4, we will calculate the first coefficient f, and express the integral h in 
terms of x, and x, to accuracy of order O{(x? + x3)*}. 

According to (9), fo = 93(0,0), and the solution with initial conditions x° = 
x9 = 0 has the form 


XpHxzp=O0, xy ext for. 
If (x9)? + (x$)? ¥# 0, then 
1 =B,0), x2 = Bd) , 


x3 =x$t f(h)t + B3(t) , 


(13) 


where the f; are periodic functions. Thus, the value f,h + f,h? + °° gives the 
deviation of the mean speed of motion of the oscillating coordinate x, from the 
value fo, which is the speed of motion in the absence of oscillations. 

Thus, in agreement with theorem 1, the following holds: 


Theorem 2. Under the conditions of theorem 1 and with A, pure imaginary, 
there exists some ¢ > 0 such that every solution of system (4), with initial conditions 
such that |x,°|? + |x,°|? < , has the form of (13). 


Remark. It is possible to give estimates for ¢ and for f (as a function of x?, 
x$) by means of a careful analysis of the convergence proof in Bruno [1971, 1972a, 
§5] or in Siegel [1956, § 15]. 


1.3. Stability with Respect to x, and x, 


In what follows, we will treat a particular case of system (4): 


X,=X2, 
X_ = L(x,) + M(x,)x3 , (14) 
x3 = N(x,), 


where, for sufficiently small |x,|, the functions L, M, and N have convergent 
power-series expansions: 


L=) Lyxt, M= YY M,xt , N=) Nx}. (15) 
KA 


Clearly, the system of the first two equations of (14) is equivalent to a second- 
order equation 
X, = L(x,) + X2M(x,) . 


Since Ly = 0, the point x, = x, = Ois a rest point for this subsystem of (14). 
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Theorem 3. In system (14), let L, = 0 fork <j and L; #0 for some j,0 <j < co. 
The condition 
j-—odd , L,;<0 (16) 


J 


is necessary and sufficient for the Lyapunov stability of the solution x, = x, = 0 
with respect to the variables x, and x4. 


Proof. System (14) has the analytic integral 


Kx3 — i 2L(u)K(u)du = const , (17) 
0 


in which 
Se) ss piss MS a ses 
K = exp | 2M(u)du = exp — “We ey. 2 1+ 

By the condition of the theorem 

| 2L(u)K(u) du = ——— Lx{** + 

0 i+ 
Hence, condition (16) is necessary and sufficient to fix the sign of the integral (17) 
in some neighborhood of the origin. Further, if integral (17) has fixed sign, then 
the solution x, = x, = 0 is stable by Dirichlet’s theorem (see Siegel, 1956, § 27). 
If the integral changes sign, then its zeros correspond to those integral curves of 
the first two equations of (14) which pass through the point x, = x, = 0. Since 
this is an isolated rest point if L # 0, and the subsystem of (14) is conservative, 
then on these integral curves lie both solutions which tend to the point x, =x,=0 
and solutions which leave any arbitrarily small neighborhood of that point. That 
is, in this case the trivial solution x, = x, = Ois not stable in the Lyapunov sense. 
The proof is complete. 0 


For example, the condition 
L, <0 (18) 


is sufficient for stability, while the condition 
140 (19) 


is necessary. 


1.4. Calculating the First Terms in f(h) 


We consider system (14) under condition (18). Since 


Fe in 


and we have the integral (17), this system satisfies the conditions of theorems 1 
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and 2. To find the coefficients f, in (6) and (12), we must calculate the coefficients 
of transformation (5.1), (5.2), then the coefficients of the series in (7) and, finally, 
use equation (9). Here, in order to calculate the coefficient f, we can limit 
ourselves to considering terms of degree up to (and including) 2k in all our 
expansions. Let us do this for k = 1. We thus write out all the terms through 
degree 2 in system (14), (15): 


X, =X, 
Xp = Lyx, + L,x? + Myx? +°° (20) 
X3 = Not Nix, + Nyx? +°°° 
Here L, < 0. We let 1 = +,/—L, (so that 4, = iA = —A,) and make the linear 
transformation 
2=EXJd, 22 =x2//2. (21) 


Then equations (20.1), (20.2) become 


2 A25" 5 
(22) 
Zp = —Az, tvzitpzzte, 
where 
v=L,A73?,, p= Mp/A. (23) 


In order to put the matrix of the linear part of system (22) into diagonal form, 
we now move into complex-conjugate coordinates 


w=2,+iz2, W=2,—iz,. (24) 
Then system (22) becomes 


it 


iv i 

w= —idw + aw + wy? — aw — wy ter 
(25) 

? iv i 

w= iw — (w+ wy? + Hw — wet 

4 4 
Next we perform a non-linear change of coordinates 
w=v+a,,07 + 2a,,00 + a0 , 

(26) 


W=0 + 4,107 + 24,,00 + G,2v" , 


which puts the quadratic terms in (25) into normal form; that is, it completely 
removes any quadratic terms, which do not appear in the normal form (6.1), (6.2). 
Thus, with accuracy to third-degree terms, 


b= —idv , v=ido . (27) 
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Differentiating equation (26) with respect to t, we obtain 


W = 6 + 2a,,v00 + 2a,,00 + 2a,,00 + 2a,,00 


and the conjugate expression for w. We then replace all the derivatives in this 
equation with their expressions from (25) and (27) and express w in terms of v 
according to formula (26). We thus obtain an equation which relates two quadratic 
forms of v and v. Equating the coefficients of like terms, we obtain linear 
equations in the unknown a,;. Solving these yields 


aya, 8 412= >, > @22 = 77: (28) 


Expressing x, in terms of v and 0 according to (21), (23), and (26), we find that 


xy {vo +0 + (ay, + ,,)0? + 2(ay2 + G,2)0d + (ay, + G,)07} . 


1 
2h 
If we substitute this into the series N(x,) = No + Nix, + Nox? +--+: then the 
coefficient of vt is 
Gy, + Gyo 1 


a BN ee 


2A 
As we can see from (9), this is just f,. Thus, using (28), (23) and the equation 
L, = —23, we obtain 


N, 


24f, = Nz + N,My —N,L2/L, . (29) 
Further, inverting transformation (26) 
v= W— a,,w? — 2a,,wW — a,,W? 4 °°: 
and employing formulae (24), (21), (28), and (23), we obtain 


1 _ 4Mod? + 2L2 . 2M 
A? 3A ; A 


vd = Ax? + x,x3 + Of{(x? + x3)?} . (30) 

Since h = vv + O(|v|*), then the integral h is equal to the right-hand side of (30). 

Let 4x, be the amplitude of the oscillations of x,, that is 4x, = max|x,|. By 

virtue of the symmetry of the equations this maximum is attained at x, = 0. 
We have thus proved 


Theorem 4. Under condition (18), the solutions of system (14) have the form of 
(13), and 
f(h) = No + 5,(4x,)° + 63(4x,)° + Of{(4x1)*} , (31) 
where 
dy = 3(N2 + NyMy — N,L2/L,) , 
(32) 
63 = 302(L2/Ly —2M)) . 
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Finally, given the system 


“=X, 
Xx, = I(u) + mx} , (33) 
X3 = n(u) 


with J, m, and n analytic functions and /(uo) = 0, then the point u = ug, x, = 0 
is a rest point for the subsystem (33.1), (33.2). If we let x, = u — up, then system 
(33) takes the form of (20), where 


Ly, =I"(uo) , L, =4I"(uo) My = m(uo) , 
No = n(uo) » N, =n'(uo) , N, = 3n"(uo) . 
Formulas (32) take the form 


5 = 4(n" + 2n'm —n'l"[I')= 4n'(n"/n' + 2m =I") = 4nd 
(34) 
53 = $4,(31"/l' — 2m) , 


where all terms are evaluated at the point u = ug. 


1.5. Statement of the Problem of a Gyroscope in a Cardan Suspension 


We consider here the motion of a heavy, symmetric gyroscope in a Cardan 
suspension, taking into account the mass of the rings of the suspension for the 
vertical axis of the external ring. We take our notation and equations of motion 
from a paper by Chetaev [1958]. Let us briefly summarize: y is the angle of 
rotation of the (external) ring (the angle of precession); @ is the angle of tilt of the 
frame (the angle of nutation); x, y, and z are the axes of the frame; ¢ is the angle 
of rotation of the gyroscope in the frame; J is the moment of inertia of the external 
ring with respect to the vertical axis; A°, B°, and C° are the moments of inertia 
of the frame with respect to the x, y, and z axes; A, A, and C are the moments of 
inertia of a symmetric gyroscope with respect to the x, y, and z axes (fig. 85). If 
friction is absent in the bearings, the only active forces are gravitational, and the 
center of mass of the frame and the gyroscope lies on the z axis at a distance ¢ 
from the fixed point of the gyroscope, then the equations of motion are 


(A + A°)6 — W2(A — C + B° — C°)cosOsind + Cowsin# = Plsiné , 


4 yg + (A + B°)sin? @ + C°cos? 0) + Ccos 6(@ + Weos#)} =0, (35) 


at 
a 
—(@+ Wcosh)=0. 
at 


Here P is the weight of the combined gyroscope and frame. These equations 
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Fig. 85 


allow three first integrals of motion, of which we will need only two: 


@+Wcosd=r, 
, . (36) 
w{J + (A + B°)sin? 6 + C°cos? 6} + CcosO(@ + woos) =k . 


Here r and k are suitable constants. Solving equations (36) for @ and W and 
substituting the results into (35.1), we obtain the system 


» b2 = — 2 
p= -7 J 042 tay © aa jsine 


e you (y — u?)? 
» ba-u 
y=- ak (37) 
ey—u 
a—uU 
@=r—u- 
ey—u 


We have here introduced the notation: 


5 — PUA + B°- C%) =k 
Cr? Ser 
Cr A+ B°- © 
haa Cy ea a = 
J+A-+ B° 
= ———__ u=cos@ 


ACE BO 
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We place the following limits on the parameters: 


J, A°, B°,C°,A,C, P20, A+A°>0, 


J+A+B°>0, J+C°>0, Cr#0. 
From these inequalities and the relationships (38), we obtain 


b#0, y<Oory>1, ye>O, lal< ao. 


The right-hand side of the first equation of (37) vanishes at stationary values 
@= 9. To simplify our calculations, we will consider separately the case 
when uy = cos 6 makes the expression in braces in (37) vanish and the case 
when sin 0) = 0. A stationary value 9) corresponds to regular precession of the 
gyroscope. 


1.6. The General Case |uy| < 1 
: “ uti? ee 
Since for |u| < 1 we have —@sin? =u + , >, the first equation in (37) 
can be written a, 


la — 42). (39) 


If we write x, =, then the system of two equations (39) and (37.2) will be 
equivalent to system (33), with 


2 a ry 
Ku) = 2 | a+ = 3 rs hu — u?) , 


u ba-—u 
me Reape aes 


(40) 


For ease of calculation, we introduce auxiliary functions 


a—u 
Gu) =—— , 
you 


F(u) = -o+G-—uG’, 
(41) 
P,(u) =u? — 2ou + y , 
P3(u) = (a — u(y — u*) + PZ 
= —2au? + 3(a? + yu? — Gayu + oy +7? 


From (41) we obtain 


1.6. The General Case |uy| < 1 245 


G= 1—2uG _ P, 
you (y- wy? 5 
era, see (l= 2G? PSG P; 


you (y — u?)P, (y— uw?) 


Now let u=uy be a root of the equation [(u) =0, with |ug| <1. Since 
[= b*e '(1 — u?)F, then ug must be a root of the equation F(u) = 0, or 


Qu) = o(y — uw’)? + (2 — ul(au — yy =0. (43) 
Thus, when u = uo, we have 
l=bel(1—u7)F’ , 
"= _Ab2e7yF’ 25-171 _ ,2\p" 
l b uF’ + bee '(1—u*)F" , (44) 


4 
Vy —— u 


zt F'/F . 
Theorem 6. If |uo| < 1, then the solution u = ug, u = 0 of system (37) is stable 
with respect to the variables 0, 6, and w if and only if one of the two conditions 
A+ B°—C°3>0 (45) 
or 
A+ B°—C° <Oand P,(uj) >0. (46) 
is satisfied. In the stable case, the mean rate of change of the angle w is 
[Ye] = n(uo) + 5,(4u)? + 53(4u)? + O{(4u)*} , (47) 


where 6, and 0, are given by formulas (54). 


Proof. By theorem 3, we have stability with respect to the variables mentioned 
when 


(up) <0, (48) 
and instability if either 
I'(upg) > 0 (49) 
or 
I'(ug) = O and I’(ug) #0 . (50) 


Let A + B° — C° > 0; then e > 0,y > 1, and F’ < 0, as we can see from the first 
expression for F’ in (42). The equation F’ = 0 is attained only when uy = « and 
y = a’, which is impossible when y > 1 > |u,|. Since 1 — u? > 0, then (44.1) 
implies that (48) is satisfied. Stability in this case was proved by Rumyantsev 
[1958]. 
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The case A + B° — C° = 0 will be treated in sections 7, 8. 
Now let A + B° — C° < 0; then e < 0, y < 0, and, by (42) and (44), 


I= —b?e7(1 — u?)P3(y — u?)3 


Therefore (48) is satisfied for P; > 0, while if P; < 0, the instability condition (49) 
is fulfilled. Finally, if P,; = 0, then /'(ug) = 0; however, as we shall now show, 
I"(uo) # 0 and, by (50), we will have instability. In fact, by (44) 


I" = b?e (1 — u?)F” = —b?e (1 — u?)Py(y — u?) >. 
It remains to be shown that the system of three equations 
Qo=0, P,=0, P,=0 


does not have acommon solution. Since P; = 6(a — u)(au — y), then the equation 
P, = 0 has two roots, uy = a and ug = y/a. By (41) 


P(a)=(y- a? > 0, 


P3(y/a) = y3/u? + ay — 2y? <0, 
since y < 0 < a?. 
Now let one of the stability conditions (45) or (46) be satisfied; then the 


condition (18) is satisfied and, by theorem 4, we can calculate the coefficients 6, 
and 63 in (47). By (42), (44), and (41), with u = up, we have 


2 
b=n"/n' + 2m— 1" =(nG’y — i “3 
4u 2u P,; ’ 
> _ dj 
eager ee jn} 
(51) 
2u 2u Pi, PS 
2 z+ a 
l—u yru P, P3 
2u(y — 1) Py Py S 
~ (L=w)y—u?) PP, Ps (L—u?)(y —u?)P,P3 ” 
where 
Py=—-2a-u), Py=6(a—ujl(au—y), 
S = 2(y — uP, P; + (1 — u*)(y — u?) (PP; — P3P2) (52) 


= 2(y — 1)uP,P; — 2(1 — u?)(y — u?) (au? — 307u? + 3ayu + 0? y — 2y?) . 


Similarly, we find 
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eee ee 
ety)?’ 
(53) 
ooeer To) he P; 
Onna! /l Sans nF) =e © OP: : 
According to (34), f(A) is expressed by formula (31), where 
5 15 1b S 
Eos Hie S Ss 
4e(y — uw )(1 — u?)P; 
(54) 


The theorem is proved. 1] 


The coefficient f; = 0, i.¢., f(h) = O{(4u)*}, only if 5 = 0, ie. S = 0. The two 
algebraic equations Q = 0 and S = O relate uy and the three parameters ¢, a, and 
y. If these four variables satisfy both equations, then a solution of system (37) 
with initial conditions 


u° = uy , uw=0, py =n(u’) , ~° =r—u°n(u’) 


corresponds to the regular precession of the gyroscope. With initial conditions 
close to these, the gyroscope will undergo pseudo-regular precession. Moreover, 
the speed of this pseudo-regular precession will be fixed to an accuracy of 
O{(4u)*}. It might be interesting to investigate these dependencies in greater 
detail. 


Remark. Since system (37) is integrable, then its solutions could be studied by 
integrating, as was done in a paper by Lunts and Smolitsky [1966] and a book 
by Lunts [1976]. As Smolitsky noted, the results of these works yield condition 
(45), with the strict inequality, and the first two terms in expression (47). 


1.7. The General Case |uy| = 1 


Here 4) = 0 or 2, up = cos 6) = +1, and we go from system (37) to system 
(33), choosing 
x, =0-6, xX,=6, x,=W. 
Then 
(0) = —b?e F(cos@)sin@ , m=0, 
(55) 
Soe b a —cosé 
~ ey—cos?6 | 
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Theorem 7. If |u| = 1, then the solution 6 = 6p, 8 = 0 of system (37) is stable 
with respect to the variables 6, @, and w if and only if one of the following three 
conditions is satisfied: 


1) —e7} F(up)uy < 0 ; 
2) F(ug) = 0, e7' F'(ug) < 0 ; (56) 
3) F(uo) = 0, F’(up) = 0, —e ! F"(ug)uy <0. 

When the first of these conditions is fulfilled, the mean speed of precession is 


— Cruo 


. k 
W="; (57) 


4, U+24 — 2B + C°)Cr — 2K(A + B? - Cu 


0 4 
U+o) Up(40)? + Of(40)*} . 


Proof. Since the expansion of the function / in powers of x, = 6 — 8 contains 
only odd powers of x,, then, in accordance with theorem 3, any one of these 
three conditions is sufficient for stability: 


di/d0<0, 
di/d8=0 , d31/de? <0, (58) 
dl/d0 = d31/d0? =0 , d°1/dee <0 , 
where the derivatives are evaluated at 6 = 0). At the same time, any of the 
following three conditions is sufficient to establish instability: 
dl/d0 >0 ; 
di/d0=0, @i/dee>o, (59) 


dl/d0 = d*1/d0° =0 , d°i/d0° >0. 
By (55) 
di/d0 = —b?e"F(ug)uo , 


d31/d0? = b?e{F(uy)up + 3F'(ug)ua} , (60) 
d51/d0> = —b?e'{ F(ug)up + 1SF’(up)ua + 1SF"(up)ud} 


Therefore conditions (56) coincide with conditions (58), and, consequently, 
they are sufficient to show stability. The case F(uo) = F’(ug) = F"(ug) = 0 is 
impossible, as shown in the proof of the preceding theorem. Hence, if none of the 
conditions (56) is fulfilled, then one of the conditions (59) must be satisfied, 
implying instability. 

In what follows, we will assume that the first stability condition (56) is 
satisfied, and then apply theorem 4. Since, when 6 = 6), we have 
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a7l d?n _ b(1 — 2attg + yuo 
deez” d@? e(y — 1)? : 
then by (31) and (34) 


1d? 
fh) = n(%) +55, 


4 az 19)" + 0{(46)*} , 


which is equivalent to formula (57). The proof is complete. 1 


Remark. [n contrast to theorem 6, theorem 7 tells us how to calculate the 
precession rate in only one of the cases of stability. This is related to the fact that 
in cases (56.2) and (56.3), 4; = A, =0 and theorem 4 is not applicable. It is, 
however, still possible to find the precession mode. To do so, we must consider 
system (14), (15) with M = 0, where N is an even, and L an odd, function of x,, 
and, in addition, either L, = 0, L, <Oor L, = L, = 0, L, < 0. It is convenient 
to use the functions C,(0) and S,(@) introduced by Lyapunov [1935b] (see also 
§4, Chapter ID. 

Condition (56.1) is equivalent, when uy = 1, to condition (2.8) in Rumyantsev 
[1958]; it is shown there that that condition is sufficient, and the condition 
—e'F(ug)uo < 0 is necessary, for the stability of that solution with respect to 
6, 0, and w. The case F(u,) = 0 was not treated there. 

It is not hard to derive from (42) the fact that condition (56.2) is equivalent 
to the condition Q(u,) = 0 and either A + B° — C° > 0, or A+ B°—C° <0 
with P;(u 9) > 0. Condition (56.3) is equivalent to the condition 


Olu)=0, A+B°—C2<0, Py) =0, Cup <0. 


As is evident from (57), the speed of revolution of the external ring is, to an 
accuracy of O{(40)*}, independent of the amplitude of the nutational oscillation 
AO, if 

k J+2A+2B°—-C? 
2uo- = a 7 
r A+ B°-C 


1.8. The Case A + B°— C°=0 
Here, in accordance with (38) 


_ P&(A + A®) _J+AtB 


=0, e? ; = eee 
: OCF oS Ade 


(61) 
F(u) es ge 


e e ey 


If we restrict ourselves to cases where |uo| < 1, then the equation for ug is 
e”' F(ug) = 0, whence 
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k PUI + A+B) 
Cr C?r? , 


Ug =a-—oy= 


(62) 


Since, by (61), 
e'F (u) = -(eyy' <0, 


we will always have stability here. Further, in formulas (51) and (53) we have 
rational functions of y. Letting y approach infinity and taking into account the 
limiting equalities (61), we find, in the limit, that 


b= 5, w= —beyy', 56,=0. 


From these, using (54), we obtain 6, = 0 and 


buy 5 Cruo 
2ey(1—ud) =. A + A + B°)(L — u2) © 


1 
6, =gn'd = —_— 


Since (37.2) has, in the given case, the form 


; k — Cru 
Y= TEA Bo 
we have 
: Cr 
Ay= ee ee : 
and formula (47) takes the form 
ps Crug(4u)? 4 
[v] = n(uo) 374A + BA ae) + O{(4u)*} 
J+A+B° , . 
= ng) — ZEA + Fo ayy? + f(y} . 


2Cr(1 — ud) 


If we disregard an estimate of the remainder terms O{...} in this last expres- 
sion, then we obtain formula (6.6) from a paper by Klimov and Stepanenko 
[1967] which treats the case A + B° — C® = 0. In particular, 5, = 0 only if 
Up = 0, that is, according to (62) if Crk = P{(J + A+ B®). 


1.9. The Balanced Gyroscope 


For a balanced gyroscope, € = 0, so that o = 0. Equation (43) takes the form 
(a — u)(au — y) = 0 and has two roots: 


u=U=4 (63) 
and 
uU=Uy = y/a . (64) 
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First we consider case (63). Since n(a«) = 0, this corresponds to a rotation of the 
gyroscope with fixed axis. As we see from (44) and (42), this always means I’ < 0. 
By (52), P; = P, = 0 and formulas (51) and (53) give us 


=~ 2u(y — 1) _ 3u 
~ (1 u?)(y — uw)’ tye” 
Since P,(a) = y — «?, then G'(a) = —(y — «?)"!, whence with formula (54) we 
obtain 
| AE b a(y — 1) 
05 = '6= 
ae 2e (1 — 0 )(y — a2)? 


(J + C°)Crcos 05 
2{(J + A + B°)sin? 0) + (J + C°) cos? A}? sin? A ° 


an b a(y — 1 
0,02, = — > —_ 
e(1 — «*)(y — a*) 


6. = 
> 3 


Since 4u + —sin0)4@ then our expression for 5,(4u)? agrees to within 
O{(40)} with formula (20) in Magnus [1955], where the rate of drift of a 
gyroscope axis was calculated for the first time, but with a completely different 
method and with no proof of convergence. This was followed by numerous works 
in which the rate of the drift was found to degree O{(40)*} (see Appendix II of 
the book by Nikolai, 1964). 

Smolitsky [1966] calculated the coefficients s, and s, in the expansion 


[i = a sx 


where the parameter x equals y(4u)*{y — (w + 4u)?}~'. Here, x plays the same 
role as did the integral h in (11). If we rewrite this series as a series in powers 
of Au, then the formulas of Smolitsky agree with our formulae up to order 
O{(Au)*}; however, Smolitsky’s calculations give [y] to greater accuracy. 

We also mention here the paper of Kobrin and Martynenko [1971], which 
examines a rapidly rotating unbalanced gyroscope. There, the value [y] is 
calculated to within O(u3), where « = max[o, Au], for small ¢ and uy — «. An 
application of the asymptotic method in the paper is not necessary for the case 
as the result is obtained from Magnus’s formula with the following simple 
calculations. In the neighborhood of uy = x, o = 0, formula (47) yields 


[YW] = n'(a)(Uo — @) + 3n"(a)(up — &)? + b29(4u)? + O(n’) (65) 


Here 6.) = 6, at ¢ = 0, up = 4; that is, 6,9(4u)* is Magnus’s drift rate of the 
balanced gyroscope. In a neighborhood of the point ug = «, o = 0, the root of 
equation (43) can be expanded into a power series in a: 


Up — & = (a? — yo + 3a(a? — yo? + O(c7) . 
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Substituting this into (65) and considering only those terms of no more than 
second degree in 4u and o, we obtain formula (3.4) from the paper by Kobrin 
and Martynenko [1971]. 

We next consider case (64). Here we have a regular precession with rate 
n(y/a) # 0, which is of the same order of magnitude as the speed of the proper 
rotation of the gyroscope. According to (52), P; = 0, P,/P, = 2a/y, and formulas 
(51) and (53) give 


oe 2u(y — 1) Py 2a(a* — 207y + y*) 
(t—u?)(y—u?) Py ya? — y?)(a? — y) ? 
3u 3a : bo? 
ge wa + Ae 
you af —y e y(a* — 9) 
Therefore 
1, b a (a*t — 2a?y + y3) 
6, =—n'6= a 2 a 
4 2e y*(a* — y)*(a* — 9") 


Since |)/a| < 1 and y(y — 1) > 0, then we must have a* — 20?y + »3 > 0; hence 
6, never vanishes. 
Finally, the analysis of the case |u9| = 1 follows immediately from section 1.7. 


1.10. The Spherical Pendulum 


The spherical pendulum is a point of mass m which moves on a sphere of 
radius ¢ (see Sommerfeld, 1947, § 18). It can be considered as a special case of our 
Cardan gyroscope with the following values of the parameters: 


J=A°=B°=C°=0, A=f?m, P=mg; (66) 
Cr=0. (67) 


In order to apply theorem 6, we let the values of the parameters in 
(66) be given, but in place of (67), we take Cr =e and let ¢ approach zero. 
Then 


Since theorem 6 remains valid for y = 1, and A + B° — C® > 0, then the basic 
motion u = up is stable. Here uy is a root of the equation 


PCA(1 — u?)? + (k — eu)(ku — &) = 0 
or, in the limit as e + 0, 


PCA(L —u?)4+ k?u=0. 
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Further, formulae (40), (51), (53), and (54) yield, when u = uy as e > 0, 
lk—-—eu ik 1 


a en a 
Fert aS, 
P,P, u 
F g P, ku 
n=>- _- . 
A(Ql—-wy A(l—u’?)? 
5 2 3u P; 3u 1 
2 SPS Oy? 
1 k l 
iments 
a 2A (1 — uw???’ 
2 k 2u? +1 
Kez dgh oo =— 
wo gree A (P= 4) 


Note that for k # 0, 6, and 6; never vanish. Thus, by theorem 6, 


ko k (4u? ok Qu? +1 
~Al—w 2A =, A(l— ap ay TAME (68) 


VI 


This formula can also be obtained directly from the equation of motion of a 
spherical pendulum (see Sommerfeld, 1947, § 18, formulae (18.8) and (18.9)). To 
do so, it is necessary to note that Sommerfeld’s notation is related to ours in the 
following manner: 


g=y, i Gan C=k/m. 


Then in our notation his formula (18.8) is 


. k 1 
Sep te 69 
¥ Al=w (62) 
Putting this into his formula (18.9), differentiating it, and recalling that 
2 
Pn uu 
CA ap ee | 
6sin u-+ i= 
we obtain a second equation 
uu? k\? ou g 
i 1 7y=0. 70 
os +(4) ard u?) (70) 


If we now apply the results of sections 1.1—-1.4 of this chapter, we will once again 
arrive at equation (68) (compare to Tsel’man, 1972). 


§2. On the Oscillations of a Satellite in the Plane of an 
Elliptical Orbit 


This part of the chapter illustrates the applications of the methods of §§ 3, 4 
in chapter III. We consider a nonlinear differential equation describing the 
planar motion of a satellite with respect to its center of mass, which moves in an 
elliptical orbit. There are two cases, corresponding to oscillations of the satellite 
in either the orbital or the absolute system of coordinates. In each case, we will 
use the calculations of the higher order approximations in the asymptotic method 
(see example 1, § 4, Chapter III) to refine results on the stability of known periodic 
solutions, as well as to find new classes of periodic solutions (see § 3, Chapter IID). 

This problem is interesting because in it we encounter strong degeneracy (to 
fifth order in a small parameter). The presentation follows that of an earlier work 
by the author (Bruno, 1976b). 


2.1. The Formulas of the Asymptotic Method 


We consider the system 


dX/dt = X = eF(X,1) , (1) 


where X = (x,,...,x,), F =(f,,...,f,) are functions analytic with respect to 
X and t and 27 periodic in t in some domain #. In accordance with the 
asymptotic method of Krylov and Bogolyuboyv, as presented by Bogolyubov and 
Mitropol’skii [1974], there exists a change of coordinates 


X=Y+ YD HY, , (2) 
k=1 
which transforms system (1) into an “averaged” system 
Y= > &G,(¥) = G(%o) . (3) 
k=1 


Here, the functions H,(Y,t) are analytic with respect to Y and t and 2z-periodic 
in t in the domain #. The functions G, are also analytic there, but do not depend 


2.1. The Formulas of the Asymptotic Method 


on t. The series in the right-hand sides of (2) and (3) are formal series (i.¢., they 
can be either convergent or divergent). System (3) is the “normal form” of system 
(1) (§4, Chapter III). From §§ 3, 4 of Chapter III, we know that the normalizing 
transformation (2) converges on the set of rest points of system (3) 


aA ={Y,e:G(Y,e)=0} , (4) 


and this set is analytic. The points Y, ¢ in this set correspond to 22-periodic 
solutions of the original system (1). Consequently, to find the 2z-periodic solu- 
tions of system (1) for small |e], we must calculate the terms in expression (3) 
which determine the structure of the set (4). To do this, it is not always sufficient 
to know the first term, G,. For example, if G, = 0, then it is necessary to find G, 
and, in those cases when G, has multiple roots, even G3. 

In what follows we will be dealing with very degenerate situations and we 
will have to calculate the terms of expansion (3) to fifth order. We therefore write 
out the equations which serve to define the functions H, and G, (k = 1,...,5) to 
fifth order as well. These equations are obtained by differentiating equation (2) 
with respect to t, replacing the derivatives X and Y according to expressions (1) 
and (3), and finally expressing X in terms of Y and t according to (2). We obtain 


F(Y,t) = G, + on, ; 
My, 
nt P Ean th Meg, 4 
Pela ie ato ey ee Mh, Ae 
Spl nt tas naa 
(5) 
wha, tho, «a, + Ma, 


Here, the partial derivatives with respect to Y and their vector products must 
be understood in the tensor sense. We replace X with Y as the argument of F. 
In what follows, we will treat a very special case of system (1): 


X, = 6X2, X2 = af (x,,0) , (6) 


which is equivalent to a second order equation. For this system, calculations 
according to formulas (5) yield the following values of G, = (91,924): 
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Jit =)1 > 912 = 913 =9is = 915 =0; 
gu =(f], 922 =9, 
Ne 
m=[ Ps] 
afe or 7 
G24 = —2Y2 [7 | > (7) 
i af of of* of oy \* of 
dnp" = 
= 3y2 pee ee el we 
se AS: af sn odie IG ay, ) | 
Ph 
[Pheer 
Here f = f()1,2), Lf] is its time average, and 
f*=f-fl. 


; = 1 
ft= {frat = ’ (a, sin kt — b, cos kt) , 


f* = ¥ (a,coskt + b,sinkt) . 
k=1 
Consequently, the “averaged” system for system (6), taken to an accuracy of €°, 
has the form 
Ji = Y2 » 


x, O are 0 
naafise| Z| etay,] 7 =| (8) 


&°{y3p(y1) + [flaii) +r0v)} 5 


nV oe af =. 
r= [) el alae) ©) 


We now consider an even more specialized case, when 


where 


f(y1,¢) = a(t)sin y, + B(t)cos y, > (10) 


where a is an even, and f an odd, function: 


a()=a(—1), B()= —B(—-2), (11) 
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and « and £ are both 2z-periodic in t. In this case we have 


(f]=[e]siny, , 


| pe | =scosy,siny, , 
ey, 


ve =0, r=r% cos? y,siny, +r? sin? y, , 


where 


s = [a&*] — [6] , 
——J ——] 


H = [a(ad*)*) — [a(68)"1 — C6C0B)) - (0a*H) — 310821, (12) 
r= Cp(pe)) — $L0d*)?) « 
Thus, in the case of (10) and (11), the “averaged” system (8) takes the form 
Yi = y2 5 
y. =ela]siny, + escosy, siny, (13) 


+ e°{y2p(yi) + Lf Ja(1) + 1) cos? y, sin y, + 7 sin? y,} 


2.2. The Problem of the Oscillations of a Satellite 


We consider the planar motion of a satellite about its center of inertia, which 
is in an elliptical orbit in a central gravitational field. Let the principal axis of 
inertia, with respect to which the moment of inertia is B, always be perpendicular 
to the plane of the orbit. We denote the moments of inertia with respect to the 
other principal axes by A and C. The equation of the relative motion was found 
by Beletskii [1959]. It has the form 


26 . dd : : 
(1 + ecos vy); — 2esinv— + ysind = 4esinv . (14) 
dv dv 


Here 6 is the angle between the radius vector of the center of mass and the 
principal axis of rotation with moment of inertia C; up = 3(A — C)/B; e is the 
eccentricity of the orbit; and v is the angular distance between the radius vector 
and the perigee of the orbit (the true anomaly). We know that |A — C| < B, so 
lul <3. 

This nonlinear second-order differential equation (14) has periodic coeffi- 
cients and two parameters e and y which lie in the rectangle & = {e,u:0<e <1, 
|#| < 3}. Equation (14) is invariant under the changes 
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vovtanu, 6-6, er-e, HOE; (15) 
voy, b->b4+n, e>e, us —p 5 (16) 
yor-y, 637-6, ee, won. (17) 


Our task is to study some of the periodic solutions of equation (14) for small 
|u|. To do this, we follow Chernousko [1963] and take as new independent 
variable the time f, reckoned from the perigee, multiplied by 22 and divided by 
the period of the revolution of the satellite (i.e., t is the mean anomaly): 


ares fire v e/1 —e?siny 


t sea 
Vite 2 1 + ecosv 


(18) 


t(v + 2x) =t(v) + 27 . 
Then 
dt = (1 — e?)?*(1 + ecosv) 2 dv . 


We now introduce new variables x, and x, according to the formulas of 
Chernousko [1963]: 


d6=mt—2v+x,, dé/dt =m —2+ &x, , (19) 


where m = const and ¢ = VJ |u| are new parameters. 
Then equation (14) is equivalent to the system 


X, =€X2, 


3 (20) 
Oe) sin(mt — 2v+ x,), 


X, = —esgn tease 
where v is considered a function of time and defined by equation (18). If m is an 
integer, then system (20) has the form (6), (10), (11), where 


(1 + ecosv)> 
(1 — e?)? 


a(t) = —sgnyu cos(mt — 2y) , 


(21) 


(1+ ecosv)? . 
(ep in(mt — 2v) . 


B() = —sgnyu 


Chernousko [1963] investigated the first approximation of the averaged system 
(13) corresponding to various integers m. Here, with the help of higher-order 
approximations, we consider two cases: m = 2 (oscillation in the orbital coordi- 
nate system) and m = 0 (oscillation in absolute coordinates). 

We note that after substitution (15), the right-hand side of the second equa- 
tion (20) is unchanged for even m but changes sign for odd m. In the averaged 
system (13), this substitution corresponds to the change 
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tot4t7, e>-e, YoY. 


Since the right-hand side of system (13) is independent of t, the right-hand side 
of the second equation (13) must be an odd function of e for odd m and even for 
even m. Moreover, for any integer m # 2, the right-hand side of the second 
equation in the complete averaged system (3) must vanish when e = 0; this 
follows from the integrability of system (20) when e = 0. 


2.3. Oscillations in the Orbital System of Coordinates 
We consider here the case m = 2. Here 
[x] = —sgny-o(e) #0. 


Chernousko [1963] drew the graph of the function g(e): on the interval 0 < e < 1, 
¢(e) is monotone decreasing and vanishes at e = eg = 0.682. 

For ¢ #0, the rest points of the first approximation of system (13) are 
determined by the system equations 


y2=0, fe)siny, =0. (22) 


The solutions of this system with » = 0 are the generating periodic solutions of 
system (20); that is, the limits as » — 0 of the periodic solutions of system (20). 
Thus, in accordance with (22), we obtain the following families of generating 
periodic solutions (x, = O everywhere): 


x,=0, e — arbitrary ; (23) 
X,=T7, e — arbitrary ; (24) 
x, — arbitrary , €=e. (25) 


We will denote families, (23), (24), and (25), of generating periodic solutions by 
Fy, F,, and G, respectively. For system (13), the families A and F, are 


Fy = {y, = 0, yz = 0;e,p — arbitrary} ; 
F, = {y, = 1, yy = 0;e,u — arbitrary} . 


It follows from property (16) that F(e, —u) = F,(e, w). Consequently, it is suffi- 
cient to investigate only one of the families A and F,. 
Now let us disregard terms of higher than third order in ¢ in system (13) and 
consider the system 
Vi =eh2, 
(26) 
yp. = —esgnye(e)siny, + e*s(e)cosy,siny, . 
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Then the fixed points of the family satisfy the equation 
ple) = us(e)cos ys, (27) 


where the function s(e) is found from formula (12). Since e = eg is a simple root 
of the equation ¢g(e) = 0, then near that point we can, by the implicit function 
theorem, solve equation (27) for e — é9: 


. S(€9) 
€ — 9 = {(ucos y,) = pcos yy ~~ + O(n?) . (28) 
g'(€o) 


That is, for every y, the family Y exists only for those values of e which lie between 
boundaries determined from the equation 


gle) = ts(e)p . (29) 
By (28), these boundaries are given approximately by the formula 


S(€9) 
y’(€o) 


: (30) 


e—eo= 4 


We will now investigate the stability of the rest points of the averaged system 
(26). The characteristic equation for these solutions is 


2? = —pe(e)cos y, + p’s(e)cos2y, . 1) 


If p(e°) 4 0, then near the point e = e°, » = 0 the stability of solutions of the 
family A, is determined by the sign of the value ye(e°), while the line p = 0 is 
the boundary of the region of stability in which pe(e®) > 0. 

For example, when g(e°) > 0, we will have stability for » > 0 and instability 
for u <0. 

One more boundary of the region of stability of the family ¥ passes through 
the point e = é€9, w = 0. By (31), this boundary is determined by the equation 


ple) = use) , (32) 


which is analogous to equation (27) with y, = 0. Solving this equation, we obtain 
in accordance with (28), 


5(€p) 


Y' (eo) 


At the same time, boundary (32) is the intersection of the families % and Y. 
Similarly, the boundary of the region of stability of the family F, is its intersection 
with the family @ and is obtained from equation (27) by letting cos y, = —1. 

In the family Y we have, according to (27) and (31), 


ee =C(w =H + O(n’) . (33) 


A? = p?s(e)(cos 2y, — cos? y,) = —p?s(e)sin? y,  —p?s(eq)sin? y, . 
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Fig. 86 


That is, for small | j¢| the entire family is stable if s(e¢9) > O and unstable if s(e,) < 0. 
The family Ap and its stability have been calculated for all values of e, up € & (see 
Ziatoustov et al. 1964, Sarychev and Zlatoustov, 1975). The published results 
are sufficient to show that s(e)) > 0 and that the family @ is stable. In fact, the 
boundary of the region of stability of the family Ap is given by equation (33). We 
see in figure 4 of Zlatoustov et. al. [1964] that s(e9)/@’(e9) < 0; from figure 3 of 
Chernousko [1963] we find that @’(e)) < 0; consequently, s(eg) > 0. 

Now let us trace the fate of the family 9 as p increases from zero. According 
to the calculations of Zlatoustov, et al. [1964] and Sarychev and Zlatoustov 
[1975], the boundaries % and Y¥, of the regions of stability of A) and F,, 
respectively, leave the point e = eg, z = 0 and then intersect again at a point M 
(figure 86). The shading shows the location of the regions of stability with respect 
to its boundaries. The periodic solutions of the families Fy and ¥, are symmetric 
in the sense that they transform into themselves under the substitution (17). In 
contrast, the solutions of family Y are asymmetric. 

Equation (14) is conservative and equivalent to a Hamiltonian system if we 
take the value of one of the parameters e or p as the Hamiltonian. Henon [1965] 
and Bruno [1972b] have considered families of symmetric periodic solutions of 
a Hamiltonian system with two degrees of freedom. It was shown that, in the 
general situation, the change of stability (the principal resonance) in a family can 
be of two types: 

I. On a critical solution, the Hamiltonian has an extreme and there is no 
bifurcation. 

II. The Hamiltonian does not have an extreme on a critical solution, and the 
family generates two families of asymmetric periodic solutions. These families 
exist only for values of the Hamiltonian which lie to one side of the critical value. 
If the symmetric family is stable for these values of the Hamiltonian, then the 
asymmetric families are unstable, and conversely: if the symmetric family is 
unstable for these values of the Hamiltonian, then the asymmetric families are 
stable. 
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The role of the Hamiltonian is played here by the parameter e. For fixed p, 
the families Fy and ¥, pass through points of the curves “ and Y,, respectively, 
and have no extreme values. The point M divides each of the bounding curves 
Ly and L, into two parts. Yj and Y;' are the parts which extend from the point 
€ = ey, p = 0, while Y and ¥? are the parts of the curves which do not pass 
through this point. We denote by Y the families of asymmetric periodic solutions 
which branch off of the families Fy and F, along the curves Y'. Evidently, these 
families distribute themselves as follows: The families Y} and Y; are defined on 
that region of the e, w plane which is bounded by the curves Y and ¥}. 
Together, these families constitute the family Y, and are stable throughout that 
region. Specifically, inside the region the stability index for the family Y is A < 1; 
but there may exist a subregion of a parametric resonance in which A < —1 and 
G is unstable. After the work of Bruno [1976b], where the family ¢ was first 
found, Sarychev, Sazonov and Zlatoustov [1980] computed the entire family 
for e < 0.998. They found that the family exists in the region bounded by the 
curves &! and #;}, as well outside of that region, and has a fold. 

We note that the family Y corresponds to new types of oscillations of the 
satellite. These are distinguished by the fact that their average values may take 
on any value between 0 and 2z, while the oscillations of families A and ¥, have 
the discrete mean values 6 = 0 and 6 = z. In this context, the oscillations of the 
family g are stable. 

The function ¢g(e) and even the family A have been found for other integers 
m besides m = 0 and m = 2 (see Beletskii, 1975, Beletskii and Lavrovsky, 1975, 
Sarychev, Sazonov, Zlatoustov, 1979). For such values of m, the equation g(e) = 
0 has one simple root in the interval 0 < e < 1; e = eg. As above, it appears that 
in these cases the family (25) generates a family Y of stable periodic solutions. It 
would be interesting to find the family Y in these cases and to trace its fate outside 
of the curves Y and ¥,. It would also be useful to compute the family Fp for 
|u| > 3, in order to discover other curves of bifurcation of other families of 
periodic solutions. It is possible that such families exist for |u| < 3 but intersect 
Fy only when |u| > 3. 
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We consider the case m = 0. Here [ f] = [a] = 0, so a first approximation 
yields nothing. We omit here the calculations of formula (12), which are quite 
cumbersome (see Bruno, 1977), but they show that 


s(fe)=0, (34) 


re) = re) = be* sgn + Ofe*) , b= (35) 


oe 


Therefore, the averaged system (13), obtained from system (20), has the form 
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1 = £2 5 
(36) 


Yo = {y3p(y1) + rM(e)sin yy} . 
For generating solutions, we obtain the equation 
rYe)siny, =0. 
Consequently, there are two families 
{yz =0,y, = 0} and {y, = 0,y; = 7} (37) 
as well as the families 
{y, is arbitrary, e is a root of the equation r(e) = 0} 


According to (35), this equation has the multiple root e = 0. Sarychev, 
Zlatoustov, and Sazonov, [ 1976, § 3.7 and fig. 30; Sarychev, Sazonov, Zlatoustov, 
1977] calculated the function r“)(e), which they called A,,(e), along the interval 
0 <e < 1. According to their calculations, the equation r“(e) = 0 has no roots 
in that interval. Hence, for u = 0, the only generating families are those in (37). 

We will denote the families of periodic solutions which are generated by the 
families (37) as Ay and F,, respectively. As a consequence of automorphism (16), 
F,(e, uw) = F,(e, — py). Hence, it is sufficient to consider just the family Fp. 

For this family of solutions, the characteristic equation of (36) is 


A? = po sgn pr(e)cosy, . 


By (21) and (12), the function r™ sgn p is independent of y, so that the family F 
changes its stability when the sign of y is changed. By (35), Fp is, for small x and 
e, stable when yt: > 0 and unstable when yp < 0. Here, 


Al = p3?e(3-24,/2) . 


All of this agrees with the results of the numerical calculations of F and its 
stability in the work of Zlatoustov, et. al. [1964]. We note that in the case 
considered, m = 0, there is very strong degeneracy (to fifth order in ¢). 
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Part I 
The Sets of Analyticity of a 
Normalizing Transformation 


Preface 


Let an analytic system of ordinary differential equations have an invariant 
irreducible k-dimensional torus 7 of conditionally periodic solutions (k > 0). 
We consider the following problem in a neighborhood of 7: find all invariant 
analytic sets containing 7. 

A new approach to the problem is proposed in this paper: we construct a 
formal normalizing transformation (§ 1), and, using the normal form, find the set 
M on which the normalizing transformation is analytic (§ 2). The results are as 
follows: if there are no small divisors (k < 1), then W = Wy; if small divisors are 
present, then ./ = By < Gy. The set By is distinguished from wy by the 
condition of nilpotency of a certain matrix B. 

The basic results are then applied to a Hamiltonian system (§ 3). In §4 and 
§5 we find the sets of analyticity for the transformations which normalize the 
system on an invariant submanifold. It is shown that the set ./, contains periodic 
solutions previously found by Poincaré, Lyapunov, Siegel, and others. On By 
there are the conditionally periodic solutions previously found by Kolmogorov, 
Arnol’d, Moser, Bogolyubov, and others. 

Key words: normal form, periodic solutions, conditionally periodic solutions, 
Hamiltonian systems. 
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Basic Notation 


Greek and Latin print letters denote numbers, functions, and series; small 
letters are scalars; capitals denote vectors and matrices, the components of which 
are the corresponding small letters, appropriately indexed. A tilde appears under- 
neath those Greek letters which might be confused with Latin letters. Vectors will 
be written as rows, but behave as column matrices in matrix multiplication. 
i= + /-1. 

X = (X1,.--,X4) Y = (V1,---) Via) 

V =(v4,...,0)), W = (Wy,---s Wn) 

Q =(@,,...,@,), the frequency vector. 

A=(A,,...,4,) and M =(,,,...,,,) are vectors of eigenvalues: Re A = 0, 

Rep; #0. 

M = (M_,M,), where Re M_ <0 < ReM,. 
P= (Pi,--+> Pads Q = (41.---51)- 

E, is the jth unit vector, 

E is the identity matrix. 

E = (€,,...,&) 1s a vector of parameters. 

|Qll =, +-+ + 4, and 

1Q| =|ail+°°° + lail- 

Q' > (Q1s---59r)s Q" a (Gr4is+++5 4): 

Q = 0 means that q; > 0 for alij = 1,..., 1. 
X is the complex conjugate to a number x. 

x = dx/dt. 

€Q, AD = 4,4, +°°° + qA,, the scalar product. 
V2 = vfto®... vf. 

C™ is m-dimensional complex space. 

Z” is the m-dimensional integer lattice. 

Ni = {0:Qe2Z50 + E, > 0}, and 
N'=NiuU-U NE 

The remaining sets are denoted by Latin script capitals: 
7 is the k-dimensional initial torus. 

, is the ring of analytic, 2x-periodic functions of a vector variable X. 
Y,(LY]] is the ring of formal power series in Y with variable coefficients 
belonging to Py. 
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PYCLY]] is the ring of convergent series in Ay [[Y]]. 

v,W_,and W, are integral manifolds of a seminormal form: 

¥ =W_OAW, = {U,V,W.E: W = 0}. 

Wy and By are formal sets, x and & their projections onto ¥. 

KH ={U,V,E: V" = 0} is a coordinate subspace of the V coordinates. 
L = £[A,,...,4,] is an invariant coordinate subspace. 

yp = Fly-sos 

B= AVNK B =BOK. 

FJ is a o-parameter family of j-dimensional tori. 


Introduction 


We shall consider the system 
dZ/dt =Z = F(Z,E) , (0.1) 


where Z = (2,,...,2,+1+m) 18 a vector variable and E = (¢,,...,¢,) is an vector of 
parameters. Let this system (0.1) be analytic in some domain @ c C**!*™*" and 
real in the domain Re Y. Suppose that in the domain Re, with E = 0, system 
(0.1) has an invariant, irreducible k-dimensional torus J filled by ‘conditionally 
periodic solutions. The following problem arises: 


Problem 0.1. For system (0.1) in the neighborhood of the torus 7, find all 
invariant sets M which contain F 


This problem can be approached on two levels: 

1) The formal-algebraic level, in which we seek formal power series which 
describe the invariant set .%. 

2) The analytic level, in which we investigate the convergence of those power 
series. 

In turn, the first level has two aspects: 

a) the complex case, in which we seek invariant sets .Z. 

b) the real case, in which we investigate the real parts Re.@ of complex 
invariant sets 4. 

Note that the classical problem of the continuation of the torus 7 for a small 
parameter ¢ is really a special case of problem 0.1. The question of bifurcations 
of families of such solutions under a variation of parameters is likewise a special 
case of this problem. Considerable effort has been devoted to solving special cases 
of problem 0.1. 

We present here a new approach to this problem: we construct a formal 
normalizing transformation for system (0.1) and then find the sets of analyticity 
of that transformation. In §§ 1 and 2, we present this approach in greater detail 
and formulate some fundamental theorems. In §§ 2—5, we derive various special 
results which follow from these general theorems. Nearly all of the references 


282 Introduction 


treat these special results. On the one hand, there are theorems on the continua- 
tion and existence of periodic solutions and on bifurcations of families of such 
solutions (Poincaré, Lyapunov, Siegel, Meyer, Henrard, etc.). On the other 
hand, there are theorems on the existence of conditionally periodic solutions 
(Kolmogovov, Arnold, Moser, Bogolyubov, etc.). This second group of results is 
connected with the problem of the effects of “small divisors” which are not 
addressed at all by the first group. 

Let us briefly describe our new approach. We limit ourselves to considering 
a torus 7 for which the system on the torus and the variational system are 
reducible (restrictions 1 and 2), and such that among the eigenvalues of the 
variational system there are / with vanishing real part (denoted by 4,,...,4,) and 
m with non-zero real part (denoted by y,,..., 4,,). We construct a formal trans- 
formation which puts the original system into a seminormal form, which becomes 
a normal form when m = 0. We use a variant of the seminormal form which 1) 
facilitates our investigation of the real parts of the invariant sets and 2) gives us 
the analyticity of the formal sets under minimal requirements. We pose a second 
problem for the seminormalizing transformation: 


Problem 2.2. Find the sets of analyticity of such a transformation. 


The answer to this problem is something like this: In the absence of small 
divisors, the transformation converges on some set y which is composed of 
stationary, periodic, and asymptotic solutions. If there are small divisors, then 
the transformation converges on some subset By of Ay. The set By consists of 
conditionally periodic and asymptotic solutions. In this connection, the sets 4, 
and 4, are defined by equations which are derived from the normal form. The 
set W%y is described by k + | equations, while the subset Ay is distinguished by 
the fact that a certain matrix B of order k + 1 is nilpotent on My. Therefore the 
fixed and periodic solutions are easily continued with respect to a parameter, 
while the continuation of conditionally periodic solutions requires either a 
large number of parameters or that the system possess some property which 
guarantees the nilpotency of the matrix B. Invertible and Hamiltonian systems 
have such properties; so most results on the existence of conditionally periodic 
solutions apply to Hamiltonian systems. The main results of this work are briefly 
given in the article [Bruno, 1975]. 

Before we proceed with our systematic presentation, let us make two remarks: 

1. It is often said that the conditionally periodic solutions on a k-dimensional 
torus 7 constitute a k-parameter family of solutions. Since each of those solu- 
tions is everywhere dense on 7, the parametrization of solutions is not regular. 
Hence, such a set of solutions cannot be called a family. The term family is 
reserved for a collection of invariant sets in which the dependence on parameters 
has a regular character (here, analytic or formal). We consider here only families 
of invariant tori, none of which is called a family if it is irreducible. 
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2. Some of the results discussed here were initially obtained for analytic 
systems, but were than carried over to sufficiently smooth systems (0.1) [Moser, 
1962; Kelley, 1967a, 1967b, 1968a; Riissmann, 1970; Bogolyubov, Mitropolsky, 
Samoilenko, 1969]. Apparently, the following general principle is valid: for every 
invariant analytic set of an analytic system, there is a corresponding smooth 
invariant set of a smooth system. It thus appears that all the results presented 
here are equally applicable to sufficiently smooth systems. 
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1.1. Statement of the Problem 


Suppose the autonomous system 
Z=F(Z,4), (1.1) 


where Z = (21,...,Zz+14m) iS a vector variable and A = (6,,...,6,) is a vector of 
parameters (4 = 0), defined and analytic on some domain of the (k +/+ 
m + n) dimensional space C**'*™*" and real in Re G. Let the k-dimensional torus 
ZF — Re be an irreducible integral manifold of system (1.1) for 4 = A°. That is, 
the torus J consists of complete solutions of system (1.1), any of which is 
everywhere dense on 7. In what follows, we will call such an integral manifold 
an initial torus [Bruno, 1973]. Our task (problem 0.1) is to find, in a sufficiently 
small neighborhood W c @ of the torus 7, all integral analytic sets of system 
(1.1) which contain 7. Our attention will mainly be focused on those real, 
integral, analytic sets which pass through 7. Our presentation has been designed 
to facilitate the study of these real sets. 


1.2. Local Coordinates 


We now introduce local parameters, E = (€;,...,€,) = 4 — A°. We will call 
functions x,(Z, F), ...,x,(Z, E), y,(Z, E), .... Yiem(Z, £), defined and analytic in 
some complex neighborhood of J, local coordinates if, on the torus 7 itself: 


I y. =7* = Yim = 9; 
2) the Jacobian, det(é(X, Y)/éZ), does not vanish; 
3) the functions x; have period 2: x; + 2x = x,,j=1,...,k. 


Then the neighborhood of the torus 7 in the region Re@ is a fibre bundle of 
the torus {X mod 2z} on the (J + m)-dimensional ball {|¥| < ¢)} and the n- 
dimensional ball {|£| < ¢9}; in this neighborhood, the torus is defined by the 
equations Y = 0, E = 0. Such local coordinates exist for every torus 7 < ReY. 
In these local coordinates, system (1.1) becomes 

X = OX, YE), 
(1.2) 

FS OOO, Ey: 
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where the 9 are functions which are analytic for | Y| < & , |E| < & and 2z- 
periodic in X. Since the torus 7 is an invariant manifold, 9 (X,0,0) = 0 and 
on the torus 7 system (1.2) has the form 


X = O)(X,0,0) . (1.3) 


By supposition, all solutions of system (1.3) are conditionally periodic with k 
basic frequencies. For example, the system 


X =Q =(q,,...,@,) = const (1.4) 
has such solutions if 
(P,Q) # Ofor all Po Z* , P#0. (1.5) 


Restriction 1. Let system (1.3) take the form of system (1.4) in suitable local 
coordinates. 


As a consequence of the irreducibility of solutions on the torus 7, property 
(1.5) will be satisfied. If k > 1, then such coordinates X do not always exist on 
7, From now on, however, we will only consider initial tori which satisfy 
restriction 1. Note that when k = 0 (7 is a stationary point) ork =1(7 isa 
periodic solution), the restriction is automatically satisfied. 

We now consider an approximation, linear in Y, to the second subsystem of 
(1.2) for E = 0: 


Y=A(X)Y, (A= 00/8Yat Y=0,E=0). (1.6) 


This is the so-called variational system for the integral manifold 7. 


Restriction 2. In suitable local coordinates, let the matrix A(X) in system (1.6) 
be constant. 


This restriction might be replaced by the weaker requirement of analytic 
reducibility of system (1.6) to some “linear normal form”. But we will continue 
to use restriction 2 for the sake of simplicity. In this connection, we can assume 
that the constant matrix A has already been put into some normal form — say, 
Jordan form. The coordinates Y can be complex for real Z. 

If k =0, then 7 is a point Z = Z°, local coordinates are Y = Z — Z°, and 
the matrix A is always constant. Ifk = 1,7 isa periodic solution, and restriction 
2 is always satisfied for complex coordinates, Y, but not always for real Y [this 
is the Floquet-Lyapunov theorem; see Pontryagin, 1961,§ 19]. Fork > 1, restric- 
tion 2 is generally not automatically satisfied, even in complex coordinates. 

Thus, for an initial torus 7 which satisfies restrictions 1 and 2, system (1.1) 
becomes, in suitable coordinates, 


X= 2+ Z(X,¥,£), 
; (1.7) 
¥ = AY + Z°X,Y,E) , 
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where the functions Z"? and Z™ are analytic for | Y}, ||, |Im X| < é9, the matrix 
A has normal form, and 

Z)=0, Z%=0, oZeY=OatY=0, E=0. 


Moreover, real values of the coordinates Z and 4 correspond to real values of 
X and E and to those values of the Y coordinates which satisfy the reality relation 


Y=KY, 


where K is some constant matrix and Y denotes the complex conjugate coordi- 
nates to the Y coordinates. The reality of the original system (1.1) implies the 
following properties: 


ZX, KY, E) = Z(X, YE) , 
K“AK=A, (1.7’) 


K*Z(X, KY, E) = ZX, ¥,E) . 


1.3. Analytic and Formal! Entities 


Let a function f(X) be analytic and 2z-periodic for sufficiently smal] |Im X|. 
It can be expanded in a Fourier series 


f= y Spexpi¢P, X> ’ 
PeZ* 


where the coefficients fp decrease like exp(—c| P|) as |P| increases, where c is some 
positive constant. All such functions f form a ring, which we denote by Y,. This 
ring is closed with respect to differentiation; that is, if fe Py, then Of/dx; € Py. 
We denote by A, [[Y]] the ring of formal power series 

f= LY folX¥?, (1.8) 

0<QeZitm 

where Y2 = y?!... ydiem and So € Px. Clearly, every series in this ring has a unique 
Taylor-Fourier expansion 


f =D feo YP expi<p,X> . 


We will call series (1.8) convergent if there exists some positive number & > 0 
such that the series 


D | fol ys l™ ---Lyreml 
converges for all X and Y in the neighborhood 
[Im X| < & , l¥|<e. (1.9) 


The convergent power series (1.8) also constitute a ring, which we denote by 
PX[LY]]. Both rings, 2[[Y]] and A,[LY]], are closed relative to differentia- 
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tion with respect to both X and Y. The sum /(X, Y) of a convergent series (1.8) 
is analytic and 2x-periodic in X in the neighborhood (1.9). 

A set .@ of points X, Y is called an analytic set on the torus Y = 0 if there 
exists a neighborhood (1.9) in which the set .@ coincides with the set of solutions 
of some system of equations 


fia =f=0, (1.10) 


where f,¢ Ay[[Y]] and f, = Oat Y = 0 [see Fuchs, 1961]. An ideal ¥ with basis 
fi...» f, in the ring AL[LY]] corresponds to -@. If fe A[LY]], we say that 
the system of equations (1.10) defines a formal set @ for which there is a 
corresponding ideal , with basis f,,..., f, in A LY]]. The set % will be 
analytic if there is a basis f,,..., f.€ PO[LY]]. A power series h e Py[[Y]] is 
said to be convergent on a formal set .@ if there exists a convergent series 
g € PALCLY]] such thath —ge ¥. 

The set .@ is called a (local) manifold if, under a suitable indexing of the Y 
variables, the set is defined by a system of equations 


Vp = G(X isos Ym), jom +i,...,l+m, 


where the g; € Ax[Ly1,---5¥m]] and m' < 1+ m. A manifold .@ is either analytic 
or formal, depending on whether the series g; are convergent or not. 

Returning to system (1.7), we note that its right-hand sides belong to 
PALLY, EJ]. A set is determined by the system (1.10), where the S,¢ AULY, E]]. 
A formal set (1.10) is said to be an integral (or invariant) if 


(Fz) + (Fz) =0, jelewr, (1.10') 


on that set; that is, ifin the ideal 7, the equations (1.10’) are identically satisfied. 
In particular, a formal manifold is determined by a system of the form 


Vi=Gi i=m'+1,...,l+m, 
(1.10”) 
E=h, , J=n+touywn, 


where gj, h; € Py LLV1,-- ++ m3 €1s-++s&n ]]. This will be an integral manifold for 
system (1.7) if, after making the substitutions (1.10”), the equations 


m' 


()) = (3.2%) + Y 095 aa , i=m'+1,...,l+n 


rer OY, 
ah, Oh; 
=(s2,29) 4 Y Step jan'+,... 
0 (eZ d+ d aut , j=n'til,...,n 


become identities in the ring Ay[Ly1,~--, Van3 Ets--+9 En 1): 
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1.4. The Normal Form 


Among the / + m eigenvalues of the matrix A let there be / of them, 4,,..., 
A,, With zero real part (Re 4; = 0) and m of them, ,, ..., 4, with non-zero real 
part (Re y1; # 0). In this section we consider the case m = 0, so that 4,,..., A, are 
all of the eigenvalues of A. Since A has a normal form, its main diagonal is 
A =(A,,.-.,4,). We now try, with the help of a nonlinear change of local 
coordinates, to simplify system (1.7). We make a formal change of coordinates 


X=U+5U,V,E), 
(1.11) 
Y=V+H(U,V.E), 


0H 
where the series €,, n;€ A,[[V,£]] and Z=0, H =0, and AV =OatV=0, 


E=0. Such a formal transformation is invertible. It takes system (1.7) into a 


formal system 
U = @(U,V,E) , 
; (1.12) 
V= YU,V,E), 


where gy, ¥;€ AyLLV, E]]; @(U,0,0) = 2; Y(U,0,0) = 0; and 0¥/aV = A at 
V =0, E = 0. We set 


9; = ¥,/v; = ¥ dros V2E* expi<P, UD ; (1.13) 


here, P runs over Z*, S runs over the first orthant of Z" (S > 0), and Q runs over 
a set Nj c Z' such that 


Q+E,20, |Qle0, j=1,...,! 


[see Bruno, 1971, introduction]. We write N' = Ni, U--- UNI. For @ we use the 
usual expansion 


® = ¥ Dogs V2ESexpi<P, UD , (1.14) 


where Pe Z*,0<QeZ',0<Se 2". Recall that the components of the vector 
A are the eigenvalues of the matrix A and that A is the principal diagonal of A. 
We will call system (1.12) a normal form, if, in expansions (1.13) and (1.14), the 
only non-zero coefficients gipgs and ®pgs are those for which 


i<P,Q) + (O,A) =0. (1.15) 
Let Q € N! be a fixed vector. We define 


InjicP, 2> + <Q, A>| 
|P| : 


9(Q) = lim 
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where the lim inf is taken for all P ¢ Z* which do not satisfy equation (1.15) as 
|P|— co. 


Restriction 3. y(Q) > 0 for all Qe N'. 


Note that for k = 0 or k = 1, this restriction is met automatically. But when 
k > 1 and / > 1, this is an added restriction on the vectors 2 and A. In some 
cases this restriction can be replaced by a weaker restriction 3’ (see § 5). 


Theorem 1.1. For system (1.7) with m = 0, there exists a formal transformation 
(1.11) to the normal form (1.12). 


For the proof when k = 0, see Bruno [1964; 1971, § 1]; for k = 1, see Bruno 
(1971, § 11]. For k > 1, the proof is similar. We now note a few properties of the 
normal form and the normalizing transformation. 

t. The advantage of the normal form (1.12) over the original system (1.7) is 
that expansions (1.13) and (1.14) contain only resonant terms. Hence, the solution 
of system (1.12) reduces to the integration of a system of reduced order [see 
Bruno, 1971, §2]. 

More specifically, there exists a change of coordinates of the form 


5, = v8"... v8texpiCN,U>, i=1,...,1 (1.15’) 


with rational «;; and v,;, which transforms the normal form (1.12) into a system 


ip? 
U = B(ij-441,---,5,E) ’ 
. (1.15”) 
(In 5) = G(Pi-a+15---,2,£) , j= Wael, 


where dis the number of linearly independent solutions Q ¢€ Z', P € Z* of equation 
(1.15). The solution of system (1.15”) reduces to the solution of the subsystem of 
order d defined by the last d equations. For every solution of this subsystem, the 
values of the variables U and 0,,..., 0,_4 are found by quadratures. If we denote 
by J, the constant term in g,, then 


A=KA+iNQ , (1.15’”) 
M 

where we have introduced the matrices K =(x,;) and N= : | = (v,;). 
\N, | 

Also 4,441 =*': = 4; = 0. Generally speaking, transformation (1.15) is defined 


away from the coordinate subspaces (that is, for all v; 4 0). 
2. If, under a suitable indexing of the variables, the inequalities 


4, #iCP,Q+qQ At +a, Fal tdy...,1 (1.15) 


are satisfied for some /’ < J and for all P € Z*, and for all integers q,,..., q; > 0, 
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then the coordinate subspace 
dpa, SS, =O 


is an integral manifold of the normal form (1.12). We denote this manifold by 
L = ¥[A,,..., Ay]; it is a formal integral manifold of system (1.7). 

3. Generally speaking, the normalizing transformation (1.11) is not unique 
(Bruno, 1971, § 1]. 

4. If system (1.7) satisfies the reality conditions (1.7’) then there exists a 
normalizing transformation which preserves the reality relations 


in system (1.12). 
5. Automorphisms. If system (1.7) is unchanged by the linear change of 


variables 
X=MYX, Y=M2yY, t=6t, 


then there exists a normal form (1.12) which is unchanged by the analogous 
change of variables 


C=M%U, V=M?V, tao. 


In particular, the normal form of an invertible system (1.7) is likewise invertible. 

6. Small parameters £ do not change under a normalizing transformation 
[see Bruno, 1973, 1974b]. They can be considered local coordinates satisfying 
the trivial system E = 0. 

7. If condition (1.5) is not fulfilled, theorem 1.1 still holds. That is, it is possible 
to put a system into normal form in the neighborhood of any invariant torus 
which satisfies restrictions 1, 2, and 3. Also, the initial torus may be stratified into 
invariant tori of lower dimension. It is necessary only that motion on the torus 
be described by a system (1.4), with Q arbitrary. However, in this work we always 
assume that (1.5) is satisfied. 

8. Likewise, if m > 0, then it is possible to normalize system (1.7) if restriction 
3 is met. Here, A is the vector of all the eigenvalues of the matrix A. Such a 
transformation yields the greatest possible simplification of the original system 
(1.7), but it converges under extremely strict conditions. Therefore, when m > 0, 
we will put the system into a seminormal form which is not as simple as the 
normal form but for which the seminormalizing transformation converges under 
less strict conditions [this was observed by Bruno, 1971, § 9]. 


Example 1.1, Let us find conditions on Q, 4; and py; under which the normal 
form is necessarily a linear system 


U=2, V=AMN, 
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where by V we mean the vector (0,,...,0;4,,). Clearly, property (1.5) must be 
satisfied, and equation (1.15), with 


A =A = (Ay... Ags My s+ +-s Um) 


can have no solutions Pe Z*, Qe N'*", ||O|| > 1. For this to be true, it is 
sufficient that any Q € N'*”, ||Q|| > | satisfy the inequality 


<Q,ReA> #0. 
If! =O and Rep,,..., Reu,, < 0, this can be assured by requiring that 


Re(u; + uj — Hy) <0, ij,k=t,...,m. 
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We now consider the case m>0Q. Let m_ and m, be the numbers of 
eigenvalues, 4;, ..., Mm, With negative and positive real parts, respectively 
(m_ + m, =m). Since the matrix A has normal form, then it has block-diagonal 
form after a suitable indexing of variables Y: 


A={Ay,A_, Ay} . 


where the blocks Ay, A_, and A, are matrices, the eigenvalues of which have 
zero, negative, and positive real parts, respectively. Moreover, these eigenvalues 
lie on the diagonals of the respective blocks: 


diag Ay = A =(A,,...,4)) , 
diag A_ = M_ = (My,...5Um_) 5 
diag A, oo M, aj (Mm +19+++>Hm) % 
We will divide every vector W = (w,,...,W,) into two parts: W_ = (w,,...,Wm ) 
and W, = (Wa tis-++> Wn): 
We now apply to system (1.7) a formal change of variables 
X =U + S(U,V,W,E) , 
y= +n(U,V,W,E), i=1,..., l, (1.16) 
Yap = Wt nr f(UVWE), jal.wm, 
where V = (v,,...,0,), W = (Wy,-.-, Wm), and all the series €; and n; belong to 
ALLY, W, E]); also, 7 = 0, H = 0, 0A/OV = 0, 0H/eW =0 at V=0, W=0, 


E = 0. Such a formal transformation is invertible. It takes system (1.7) into a 
formal system 
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U = D(U,V,E) + B(ULV,W,E) , 
V = WU,V,E) + Y(U,V,W,E) , 
: . (1.17) 
W_ = X_(U,V,W_,E)+ X_(U,V,W,E) , 
W, = X,(U,V,W,,E) + X.(U,V.WE) , 


where all the series in the right-hand sides belong to A, [[V, W, £]]. Recalling 
that W = (W_, W,), we take 
=0, P=0 forW=0; 


X¥_=0 forW, =0; (1.18) 
x, =0 forW_=0; 


ov Ox OX 
= Ay, => = 0, and — = {A_,A,} wh pa 
ay 3y , an aw! } when V = 0, 


and @=02,¥=0,X¥ =0 
W = 0, and E = 0. 

The series © and ¥ in (1.17) are independent of W; they can thus still be 
expanded as (1.13) and (1.14). The series X_ and X, are independent of W, and 
W_, respectively; let us consider the expansion 


4 = why = 0; Y MyporsVOWRESexpi<P,U) , f= 1,...,m, (1.19) 


where Pe Z*,0<QeZ'.0< REN, and0<SeZ". 

We call the system (1.17), (1.18) a seminormal form if: 

1) The expansions of ® and ¥ have the properties of a normal form; that is, 
the coefficients in the series (1.13) and (1.14) vanish except when their indices 
satisfy equation (1.15). 

2)$6=0,%=0,¥ =X. =OforW. =0; 

}=0,8=0,¥, =H, = 0 for Wy. = 0. 
3) The coefficient hjpgrs in expansion (1.19) vanishes unless ¢R, Re M> = 0; 
that is, 
<R_,ReM_>=0, R,=0 forj<m_, 
: (1.20) 
R_=0, «R,,ReM,>=0 forj>m_. 


The seminormal form for m, = 0 and k = 0 was introduced by the author 
[Bruno, 1971, § 9]; the normal form is a special case of that situation. 


Theorem 1.2. For a system (1.7) which meets restriction 3, there exist a formal 
transformation (1.16) to a seminormal form (1.17), (1.18). 


We note a few properties of the seminormal form (1.17): 
1. The following formal manifolds are integral manifolds 


1.5. The Seminormal Form" 
W_={U,V,WE: WwW, =0} , 
W, ={U,V,W.E: W. =0} , 
v ={U,V,WE:W=0}. 


2. Clearly, 17 =W_AwW,. On the integral manifold #, system (1.17) 
reduces to system (1.12), a normal form. 
3. On the manifold W_, system (1.17) becomes 


U = @(U,V,E) , 
V = WU,V,E) , (1.21) 
W. = X_(U,V,W.,E) , 


which is a seminormal form in the sense of §9 [Bruno, 1971]. Let the variables 
W be ordered in such a manner that 


Rep, <-::< Ren, <0< Repu, 4, <°*: < Rey, . (1.22) 


Then in system (1.21), the x; depend only on those w, such that either i <j < m_ 
orj <iand Rey; = Rey,. That is, X_ has quasitriangular form [see Bruno, 1971, 
§ 2]. Specifically, if for some j < m_ 


Rep; < Reps ; (1.23) 
then the manifold 


W; = {U,V, WE: W, = 0, Wy = 07° = We = 0} 


is a subset of W_ and is an integral manifold for system (1.21) and, consequently, 
for system (1.17). 

Similarly, for any j > m_ for which equation (1.23) holds, there is a formal 
integral manifold of system (1.17), 


W, = {U,V,W,E: W_ =0,We_ 4 = = Ww, =O}, 
which lies on #,. Clearly, the following relationships hold: 
WoW, oW_, 0<i<j<m_=; 
W.>2W2W;, m <i<j<m. 


4. The seminormal form still has all properties of reality and invariance with 
respect to linear automorphisms which the original system (1.7) might have 
possessed (see properties 4 and 5 in section 1.4). 

5. The real parts of the manifolds W_ and W,, are filled with solutions that 
approach the manifold ¥ as t > +00 and tf > — oo, respectively. 
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6. Let .& be some set on the manifold ¥”. We take 
My ={U,V,WEUVE~€ M) , 
MM =Myrnw_, M, = My NW, , 
Mp = My OW; . 


If the set .@ is invariant for system (1.12), then. #_,.#,, and .@, will be invariant 
sets of system (1.17). Generally speaking, however, @y will not be an invariant 
set. The solutions in the sets Re.@_ and Re.#, asymptotically approach solu- 
tions in.# ast > +00 and t + —oo, respectively. 

In the following analysis the role of the W coordinates is insignificant, and 
we recommend that they be ignored (assume m = 0) on first reading. Funda- 
mental problems are solved in the U, V, E coordinates, without reference to the 
W coordinates. 

We will call system (1.12), obtained from the seminormal form (1.17) by setting 
W = 0, the normal form of the original system (1.7) with m 4 0. 


§2. Questions of Convergence 


2.1. Convergence in an Entire Neighborhood 


If a normalizing transformation (1.11) (or seminormalizing transformation 
(1.16)) converges for sufficiently small |Im U|, | V|, |W], and |], then it establishes 
in some neighborhood of the initial torus, a one-to-one correspondence between 
the solutions of the original system (1.7) and those of the simplified system (1.12) 
(or (1.17)). However, there does not always exist such a neighborhood of the 
initial torus J, i.e., a neighborhood in which the transformation mentioned 
converges. There thus arises: 


Problem 2.1. Under what conditions on the seminormal form (1.17) does the 
normalizing transformation converge in some neighborhood of the initial torus? 


A more traditional statement of the problem asks about the convergence of 
the normalizing transformation in the sense of property 8 of section 1.4. 

Conditions on the normal form which guarantee the convergence of the 
normalizing transformation have been found for k = Oandk = 1, and it has been 
shown that these cannot be improved upon [Bruno, 1967, 1969, 1971]. Fork > 1, 
only the problem of transformation to a linear system has been considered 
[Belaga, 1962; Bogolyubov, Mitropolskii, Samoilenko, 1969, Ch. VI, theorem 
25], as in example 1.1. 

The problem of the convergence of the seminormalizing transformation for 
k = 0 and m, = 0 has been addressed by the author [Bruno, 1971, §9]. Trans- 
formations to forms which possess individual properties of the seminormal form 
have been treated by Kelley [1967b, 1968b, 1969] and Bibikov [1970a, 1970b, 
1973b]. 

We now formulate an answer to problem 2.1. We take 


a, = minji¢P, Q> + <Q, A>| for Pe Z* , 
QeN', JQ] 20, iKP,2>5+ (0,40 #0, 
[PLS Qi <2) fede. 
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We then write 
a= info; , bea) ees 


Condition «: « > 0. 
Condition B: 8 > —oo. 


If condition « is satisfied, then “small divisors” are absent. Condition a is 
equivalent to saying that the numbers iw,, ..., ia,, 4,, ..., 4, are mutually 
comparable. If condition f is satisfied, then there may be small divisors, but they 
are not too small. If condition « is fulfilled, then so is condition f, since B > 
Ina > —oo. In §5 of Bruno [1971] condition « is called condition ¢. Condition 
B for k = 0 was introduced by the author [Bruno, 1967] as “condition «”, and 
later [Bruno, 1971] for k = 0 and k = 1 as “condition w”. 

Normally, we place limits on the smallness of divisors by employing Siegel’s 
condition, which can be written in our notation as 


a> c2™ 5. FS Baas (2.1) 


where c and v are positive constants. If this condition is fulfilled, then 


“Inc — vjln2 . 
p>y = 7 =Inc vin2 )’ 


j=1 j=l 


j ; 
57> —23 


that is, condition £ is fulfilled. We also note that under condition B, restriction 
3 is automatically met. 


Condition A. There exists a series a(U, V,E) € A,[[V, E]] such that in system 
(1.17) 
9; = 0,a(U,V,E) , Pad, eg Kies 


Wj; = vg; = v,A,a(U, V, E) {oa ? Aen 
Theorem 2.1. If, in an analytic system (1.7), Q and A satisfy condition B and if 


the seminormal form (1.17) satisfies condition A, then the seminormalizing trans- 
formation (1.16) is analytic in some complex neighborhood of the initial torus 7. 


This theorem is proved for k = 0 and m, = 0 in §9 of Bruno [1971] and is 
formulated for k = 1 and m = 0 in § 11 of the same work. 


2.2. Convergence on Sets 


Note that condition f is an extremely weak limitation, since it is satisfied by 
almost all Q and A [Siegel, 1952]. In contrast, condition A imposes an extremely 


2.2. Convergence on Sets es ee 


strict limitation on the right-hand sides of the seminormal form, and requires 
k + 1—1 linear relationships among the series ® and ¥. The author has shown 
[Bruno, 1971, theorem III] that condition A cannot be weakened. We therefore 
pose a second problem. 


Problem 2.2: For a seminormal form (1.17), find all sets M which are analytic 
on the initial torus Z and on which the seminormalizing transformation (1.16) 
converges for any given analytic system (1.7). 


Let us clarify this. As we know, the set of points of convergence of a power 
series of one variable, f(y), consists of a disk |y| < p and some of the points of 
its circumference |y| = p. The set of points of convergence is of a more compli- 
cated nature for a power series in several variables. For example, the series 


Vy s kt y§ 
k=0 


converges only on the axes y, = 0 and y, = 0 and diverges at any point y, # 0, 
y, #0. In the terminology of section 1.3, problem 2.2 can be formulated as 
follows: for a seminormal form (1.17), we seek those formal sets .4 on which the 
series ¢; and n; in the seminormalizing transformation converge. The convergence 
of the series 5 and H from (1.16) and ®, ¥, and X from (1.17) on a set .4 (or 
modulo its ideal, f) is proved all at once; as a consequence, we obtain the 
analyticity of the set .@ itself. 

We will consider the equalities of condition A as equations which define a 
formal set .7,. That is, 


ty ={U,V,W,E: ® = Qa,W, = Ayyjaj = 1,...,0) . (2.2) 


Here, a behaves as a free parameter. It can be eliminated from the equations, so 
that Wy is defined by the system 


Big Mi be dick 
wo, Av; ” eee 
4,40, l<j<l; (2.3) 


Wp=0, ifdap =O. 


Theorem 2.2. If condition « is satisfied for system (1.7), then the seminormaliz- 
ing transformation (1.16) is analytic on the set oy, which is itself analytic. 


In this theorem, condition « excludes the presence of small divisors; under 
condition (1.5) this is possible only if k = 0 or k = 1. We now consider the case 
of small divisors. Let L = {A,,...,2,} be the diagonal matrix formed from the 
diagonal of the matrix Ay. We consider the square matrix of order k + 1 
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ab =a 
au 

B= (2.4) 
oy aw 
a eke 


on the set .%,; here a is just the parameter which appears in the equations (2.2) 
which define the set %,. We define the formal set #,, a subset of A,, as that 
set on which the matrix B is nilpotent. That is 


By = {U,V WE: U,V,W,E€ ty, BR = 03 . 


Theorem 2.3. If condition 8 is satisfied for system (1.7), then the semi- 
normalizing transformation (1.16) is analytic on the set By, which is itself analytic. 


Thus, in the presence of small divisors, the nilpotency of the matrix B is 
necessary for our transformations to converge. The author has noted the neces- 
sity of this sort of condition in the problem of a normalizing transformation 
(Bruno, 1967, condition A,; see also Bruno, 1971, $8]. 

Note that both sets 4, and #y are defined only with respect to U, V, and 
E; the W coordinates can be arbitrary on those sets. It is therefore sufficient to 
investigate the properties of the sets 


B=AyAv¥ andB=Bynv . 
Example 2.1. We show here that theorem 2.1 is a special case of theorem 2.3; 


that is, under condition A, the matrix B is nilpotent for every U, V, and E. In 
fact, under condition A, 


en da OM _ Oa 
du ‘aU’ av ‘av’ 
ow, da Ow, da 
hi = Avisg ‘ ay = AjaE, + Avy : 
that is, 
@, 
pu| % da da Ga_ a 
A,v0,] \Ou, Ou, dv, dv, 
Avy 
Consequently, 
k t 
da da 
B? = 70) = +) =a) 
i Om fat OM (2.5) 


=B )' (iKP,2> + (Q,A))apqs VES expicP, UD , 
P,Q,S 


2.3. Properties of the Set 


where a = ¥ apgs V2ES expi<P, U> is the power series expansion of a. Since this 
expansion contains only resonant terms for which the exponents P and Q satisfy 
equation (1.15), it follows that the sum (2.5) over P, Q, and S vanishes identically. 
Consequently, B? = 0 for any U, V, and E. 


Example 2.2. In connection with the preceding example, we might ask: is it 
possible that 2 4 .? We show here that it is possible. We consider the normal 
form 

wm=l, 
i, = /2, 
0, =0,(0, — &) . 


For this system, k = 2, = 1, m=0,n = 1, A, = 0, and the set .o is defined by 
the equation v,(v, — €,) = 0; that is, it consists of two components 


of) = {U,v,,8,: 0, = 0} ? = {U,v,,6:0; =&,}. 
Here, the matrix B is 
‘0 0 Oo. | 
[« 0 0 | 
0 0 20, -& 


The set & is distinguished from the set ./ by the additional equation 2v, — ¢, = 
0. Examining this equation on each of the components ./! and .o/?, we see that 
B = {U,v,,8,: 0, = &, = O}. That is A= 7 4 A. 
2.3. Properties of the Set / 
We consider here the properties of the set oe = %y OV. It is defined by the 


normal form (1.12), which, for simplicity, we will assume is analytic. 


2.3.1. General Properties 


1) Let 
b(U, V,E) = ¥ bpgs V2 ES expi<P, UD 


be a series containing only resonant terms. That is, bpgs # 0 only when equation 


(1.15) holds for the indices P and Q. We will show that, on the set x~, 


In fact, 


On the set ./, we have 
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k ! 
° . wl L te | 0b 
sas Or alae aa 
ate y; (iXP,Q> + <Q, A>)Dpgs VES expiCP, UD . 


P,Q,S 


Since P and Q always satisfy equation (1.15), the term in parentheses vanishes. 
The assertion is proved. 

2) The set .o is invariant in system (1.12). In fact, by (2.3), the set is defined 
by equations of the form 


g/o,= 9/4; and g;=0. 


We need to verify that differentiation with respect to t does not cause these 
equations to be violated on .W. But all of these series contain only resonant terms. 
Thanks to the preceding property, we know that the derivatives of such series 
vanish on the set .. Consequently, on /, 


9/0, = G/4,;=9 and g,=0. 


3) The value of the parameter a is fixed on each solution in .o/. In fact, 
a= ¢,/w, on &, and, as we showed above, da = 0 there. 

4) Let a point U°, V°, E° be in , and let a = a° at this point. From the 
preceding property, we know that any solution which passes through the point 
U°, V°, E° satisfies the system 


* * 0 — 
U =Qa° , b= Apa, jell. 
From this we obtain 
U=2a°t + U°, 


(2.6) 
v,=vPexp(4ja°t), jel... t. 


5) Let d be the number of linearly independent solutions P « Z*, Q € Z' of 
equation (1.15). In the set 


W* = {U,V,E: 0, #9,...,v, # 0} 


we can, in accordance with property 1 of section 1.4 introduce coordinates V, 
such that the normal form (1.12) takes the form (1.15”). Then the set ./* = 
OY * is defined by the system of equations 


P(5,-441,---.5,, E) = Qa 
- (2.7) 
Oizdwwatey= 7s - “Poteen ie 


Since 4,_4,, = ++: = 4, = 0, then we can separate from system (2.7) a subsystem 


2.3. Properties of the Set ST 


G@i-ass-- 8 E)H0, fal-dtt..t. (2.8) 


That is, rest points of the subsystem of the last d equations of system (1.15”) 
correspond to solutions in the set *. This is not true for all rest points of this 
subsystem, but only for those at which the functions @ and §,,..., G4 take on 
the values Qa and /,a,..., 4,44, respectively. Thus, the set ../* is defined by k + | 
equations in the d + n + 1 variables 0,_4,,,..., ;, £, and a; that is, there are 


o=dtn+1—-(k+) 


free parameters on the set o/*. If o < 0, then »/* is the empty set; if o > 0, then 
* is a o-parameter family of solutions. Note that o can always be made positive 
by increasing n, the number of parameters. 

6) In the preceding arguments, we tacitly assumed that among the numbers 
Wy, +115 Des Ayy --+y Ay, at least one was non-zero, and that the parameter a was 
defined on the set ¥*. We now consider the case k = 0, A, =+:: = 4, = 0; that 
is, d= 1. In this case we can assume that a = 1, and the set .o is defined by 
k + | = d equations in d + n unknowns. That is, the number of free parameters 
on &f is 

go=n=dimg#. 


The set .o itself consists of the rest points. 


2.3.2. The Set . on Subspaces 


Let 0 <1’ < I; we divide every vector V = (v,,...,v;) of dimension | into two 
subvectors V’ = (v,,...,vp) and V” = (v,-4,,..., ¥,) of dimensions of I’ and | — I' 
respectively. Let 

KH ={U,V,E: Vv" = 0} (2.8) 


be a coordinate subspace of a manifold ¥. Along with the set ./ we will also 
consider the set .o/’ = oJ 7 #. If equations (1.15””) are satisfied, then # is an 
integral manifold of system (1.12), which in this case we will denote by Y. 

7) The intersection of . with any coordinate subspace 4% is invariant in 
system (1.12). This is evident from solution (2.6). The set /’ = /% OX is given 
by the system of equations 


b = Qa : (2.9) 
W=adya, joist, (2.10) 
Pr’ =0, (2.11) 


66 99 


where means that the functions are evaluated at V" = 0. Let d be the number 
of linearly independent solutions P € Z*, Q' € Z" of the equation 


iKP,Q> + <O'",A’> =0. 
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We write 
KH*= {U,V,E: v" =0;0, 0,7 = 1,...,17} * 


According to (2.6), solutions from the set o’* = JA H* haver’ =k+l—d 
independent frequencies. We call the number r’ = r’(.#) the irrationality index 
of the subspace # . By virtue of (1.5), 


keor(A)sk+l’. 


If ¢) > #, then (XH) > r'(H). 

8) On the integral manifold #, we have " = 0. Therefore, the set .o/’ = 
of cy & is defined by the system (2.9), (2.10) above. Let us apply a transformation 
to the V’ variables like that in the discussion of property 1, § 1.4. We find that the 
number of parameters on the set .o/’* = dn £* is 


; d@t+nt+1—-(k+l)=n41—-r', ifr 40, 
a= . 
n, ifr’=0. 


Specifically, we always have that a’ < n; that is, for “general systems” the set of 
reduces to the initial torus 7 unless the number n of parameters is positive. If 
¢' > F?, then 

o(L')<a(L?), 


hence, it always happens that o < o’. 


2.3.3. The Real Part 


We now consider the real part of the set .o/, (see property 4 in § 1.4). 

The parameter a takes real values on the set Reo. 

9) All solutions in the set Re. are conditionally periodic, as may be seen 
from (2.6). If this solution lies in the subspace . but not in any smaller subspace, 
then it lies on an r’-dimensional invariant torus (r’ = r’(%)), and has r’ inde- 
pendent frequencies. In particular, solutions in Re * have r(¥) independent 
frequencies. 

10) Thus, the set Re. consists of invariant tori filled by conditionally 
periodic solutions. As may be seen from property 5, the torus 7° < .of is defined 
by the constant values o?_4,,,..., 5?, E°, which correspond to the value a = a®. 
The frequencies on this torus 7° are 


7) = 0 7 rey 
Qa’, iA, a”, ..., tj ga”. 


The eigenvalues of the torus 7° are the eigenvalues of the rest point, 0° 441, 
..., OP, E° in the subsystem of the last d equations of the system (1.15”), ie., the 
eigenvalues of the matrix 
OGr-a+i S158 ¢ 9) 
O(n By-441,---,1n d,) 


(2.11’) 


evaluated at the indicated rest point. 


2.3. Properties of the Set. re 


Note that the real dimension of the set Re. does not exceed the complex 
dimension of the set ., but may be less. In particular, the number of real 
parameters on the set Re.’ does not exceed the number o’ defined in property 8. 


2.3.4. Adjoining Families of Tori 


A (j + s)-dimensional set is called an s-parameter family of j-dimensional 
irreducible invariant tori of the system (1.12) and is denoted by #; if on this set 
there are independent functions h,, ..., h, (parameters) to each value of which, 
h°, ..., h® there corresponds a j-dimensional invariant torus in F/. The family 
F{ adjoins the torus 7 if the intersection of the family with any neighborhood 
ofthe torus.7 isa family of the same type. In this setting, the family ¥/ is assumed 
to be either analytic or formal corresponding to the respective properties of the 
(j + s)-dimensional set and of the parameters h; in it. 


Theorem 2.4. If the family FJ adjoining the torus JF lies in the subspace H 
but in no smaller subspace and if r' = r'(4) # 0, then j = r' and F3 always lies in 
the set Re s#. Conversely, each connected component S of the set Re & adjoining 
TF forms a family Ff if S lies in H but in no smaller subspace. 


The proof follows from the arguments of property 5 above, extended to an 
arbitrary subspace .#. The restriction of the theorem (i.e., r’ 4 0) is needed only 
for k = 0 on the integral subspace #0], corresponding to the collection of all 
zero eigenvalues: 

ApS =A, =0 (4; #0,j >U') 


according to (1.15””). On the subspace # [0] the set of 7 [0] consists of rest 
points (see property 6). However, in # [0] there may be families of periodic and 
conditionally periodic solutions formally adjoining 7. These families don’t lie 
in the set / and they may not be analytic families of the original system (1.7). 


Example 2.3. Consider the system 


4yo(yt + 3) 
—4yi(yt + y3) (2.12) 


V3 = V3 + (V1, 92) 


Yi 


Hl 


Ya 


where k =0,/=2,m=1,n =0,4, = 4, =0, pm, = 1. The subsystem for y, and 
yz has the integral h = (y? + y3)’, and its solutions are periodic: 


y= “/hcos4,/ht , 2 = Yasina She . (2.13) 


The seminormalizing transformation of system (2.12) coincides in this case with 
the normalizing transformation and may be written as 


yp =, V2 =02, yz =wet (01,02) . (2.14) 
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The seminormal form of (2.12) coincides with the normal form and is 
&, = 4v,(v? + v3) 
by = —4v, (vz + 03) (2.15) 
w=w. 


The manifold ¥ = {v,, v2, w: w = 0} of (2.15) is filled with the periodic solutions 
(2.13), and for it r’ = r(¥’) = 0. The set .o is defined by the system 


4v,(v? + v3) = —4v, (v7 + v3) =0. 


For real v, and v, this system has only the trivial solution v, = v, = 0. Hence, 
Re W = {v, =v, = w = 0}. But on the manifold Re ¥ all solutions of (2.15) are 
periodic and form a one-parameter family with parameter h. The formal family 
F;} lies in ¥ = XH and r'(#) = 0. Consequently, not all formal families of 
periodic solutions adjoining the point v, = v, = w = 0 belong to the set .o. 

On the other hand according to Lemma 7 of §7 [Bruno, 1971; see also 
example 1 of §6] it is possible to choose a convergent series @(y;, yz) in (2.12) 
so that the series é diverges in the normalizing transformation (2.14). That is, the 
formal family Re ¥ of periodic solutions is not analytic. Thus, not all formal 
families of periodic solutions adjoining a fixed point are analytic. 

The following is given in Bruno’s paper [Bruno, 1970]. 


Hypothesis 1: All formal families of periodic solutions adjoining a fixed or 
periodic solution 7 are analytic. 


The example considered shows that for a fixed solution 7 this hypothesis is 
false. It will be shown in § 4 that the hypothesis is true for all formal families .F,' 
located outside the subspace ¥ [0] for k = 0(see theorem 2.4), and the hypothesis 
is always true for k = 1. 


Example 2.4. Consider system (1.7) with k = 0, | = 2, m_ arbitrary, m, = 0, 
n= 1, where m,A, + m,A, = 0 for certain positive integral values of m, and m). 
That is, system (1.7) passes through a single positive resonance. Normal form 
(1.12) is 


B= vy DY gpale\(vr os?) = vg (oT vee), f= 1,2, (2.15') 


k=0 
where g,,(¢) are power series in «. If we set } = v7 p32, then 
b= b(myg, + m2g2) = if(i,e) . 
The set .f is defined by the system of equations 


vg; = Ajvja , ie eae 


2.4. Properties of the Set P= By, 


consists of three components 
A = {Vie:g, =A,a,0, =0} = FLA], 
wf? ={V,e:v, = 0,9, =A,a} = D2], 
St ={V,0:9, =4,a,g, =4,a) ={f =0}. 


The components .' and .o? are invariant coordinate subspaces. Then for .«/3 
we have the equation 
f =m g,(i,e) + m.g,(8,82 =0. 
If 
of/ie #O for b=e=0, (2.16) 


then this equation has the unique solution e = é(vj"' v3). As long as m, = 0, the 
set Wy = _ is invariant. Condition a is satisfied in this case and, according to 
theorem 2.2, the set o, is analytic. In particular the invariant component 
3, = of} is analytic and, if (2.16) is satisfied, is also an analytic manifold. This 
result appears as theorem 2 in Pyartli’s paper [Pyartli, 1972]. In that paper the 
points 4, and /, lie on a line passing through the origin, while the remaining 
eigenvalues yu; lie on one side of the line. By introducing a new time variable 
t = tt, wher t is a suitable complex number, the line containing 4, and 4, can 
be made to coincide with the imaginary axis and Pyartli’s case is reduced to that 
considered here. 

If the original system is real and 2, and A, are imaginary, then J, = —A, and 
the reality relation is v, = v,. Therefore = v, v2 is real and 0 > 0 for ail real 
values of the original variables. The set Re.o/° is a one-parameter family of 
periodic solutions. If condition (2.16) is imposed, this family may be written in 
the form 

e=é&0) =cyo+c,0? +°° 


The values of ¢ in this expression have a fixed sign. As t > 00 the solutions in the 
set Re.o/? tend asymptotically to the family Re 7°. Here we have that 


Re! = Rev? = {Vie:0, =v, = 0}. 
2.4. Properties of the Set # = By nv 


2.4.1. The Set @* =Bayv* 


1) First we consider the matrix B on the set * = 7 1 ¥*. Since on it we 


have 

ow 09; 

fa =(v,—t Lp sBeaghts 
av 74 (0) es 


J 
then 
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Op Ob 
oU ov 
B= =DB,D" , 
09; iy 09g; 
v;—— _——* 
“Ou, dv; /) 
where D is the diagonal matrix {1,..., 1,v,,.-.,v,} and 
ob 6@ \ 
cU oélnV 
B, = 
6G = «#60G 
6U élnV 


If for system (1.15”) we take an analogous matrix, 


Ob O@ 
= oU éinV 
B, = = 

6G §=6aG 

oU élnV 


then according to (1.15’) and (1.15’”) it is similar to a matrix B,: 


B, = AB, A" , 
where 
E 0) 
A= ; 
. iy K) 


Thus, the matrices B, and B, are similar on the set ./*. Hence, the set B* = 
B 7 of* is distinguished from the set «/* in that on * the matrix B, is nilpotent. 
Since © and G do not depend on U, the matrix B, is nilpotent simultaneously 
with the matrix 0G/A(In V). Moreover, G is independent of d,,..., 3,4. Therefore 
the set A* is distinguished by the nilpotency of the matrix (2.11'); 1e., on @* the 
eigenvalues of the matrix (2.11’) all vanish (all the coefficients of the characteristic 
polynomial! of the matrix are zero). Hence the set 4* is characterized within ./* 
by d equations, and the number of free parameters on the set #* is 


Op=dtn+t1—-(k+l—d=n+1-(k4+l). 


It is assumed throughout that d # k + 1. 

2) According to property 9 of section 2.3, solutions in the set Re ./* are 
conditionally periodic, while their characteristic numbers are the eigenvalues of 
the matrix (2.11') (see property 10 of section 2.3). Hence, the only tori in Re #* 
are those in Re .#* for which all eigenvalues vanish. The set 4* may be described 
as follows: it consists of those stationary solutions of the subsystem of the last d 


2.4. Properties of the Set #=By avs 


equations of (1.15”) for which the functions ®, 9,,..., 9,_, take on the values Qa, 
4,4, ..., 4,ga, and for which all eigenvalues of the subsystem vanish. The 
invariance of the set &* in system (1.12) follows from all this. 


Example 2.5. Consider the following system 


xy =o + éf(X,8) > 
(2.17) 
ye ee 


on a two-dimensional torus, where ¢ is a small parameter. Here k = 2,/ = m = 0, 
n = 1, while the initial torus 7 has, for ¢ = 0, 2 = (a, 1), where w is an irrational 
nisiibet Condition (1.5) is satisfied and the normal form is 


U = @&(s) = 


The matrix B is everywhere nilpotent since ® does not depend on U. The set 
Jf = ® is defined by the system @(e) = Qa, which reduces to an equation 
~p,(e) = we,(é) after dividing by a. If this equation is not identically satisfied, then 
near € = 0 the only solution is ¢ = 0. 

We introduce another small parameter 6 into system (2.17): 


X, =o+d+ ef(X,¢) , Koh (2.18) 
The corresponding normal form is 


a, =at+6 + e9{Me,d) + 6’ p?(6) , 
(2.19) 
iy = 1 + eg}"(e,5) + 6? g(6) . 


As before, the matrix B is everywhere nilpotent. The equation for the set « has 
the following form 


3 + 6pj(6,5) + 57 @\?(5) = leg}? + 5? 9p) . 


Near the point ¢ = 6 =0 it has the unique solution 6 = 5(e). According to 
theorem 2.3 if 2 satisfies condition f, then the transformation of system (2.18) to 
normal form (2.19) is analytic for 6 = 5(e), which is itself an analytic function. 
That is, for sufficiently small ¢ there exists 6 = d(e) such that system (2.18) is 
reducible to the form 

u, =a, u,=a 


by means of an analytic transformation. In fact, this is a form of theorem 2 in 
§13 of Arnold’s paper [Arnold, 1961]. In order to reduce the system 


X =Q +4 eZ(X,e) 
to the form 
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U=2 (2.19’) 


for arbitrary k > 2 it is necessary to add a k-dimensional parameter 4 to the 
right hand part of the system. Then the normal form is 


U=Q244 + O(e) + O(Al?) = Ge, A) . 


The part of the set «/ = # on which the parameter a is | is defined by the 


system 
@-Q=A+ Ole) + O(\A?)=0. 


According to the Implicit Function Theorem this system has the unique solution 
A=A(e), 4(0) = 0. According to theorem 2.3, if Q satisfies condition f and if ¢ 
is sufficiently small, then there is a function 4(e) such that the system 


X =Q+ Ale) + &Z(X, 8) 


may be reduced to the form (2.19’) by an analytic change of coordinates. This is 
theorem 9 in Arnold’s paper [Arnold, 1961]. There is a misprint in the formula- 
tion of Arnold’s theorem: du/dt = y should appear in the last formula, not 
du/dt = 2nu. 


Example 2.6. We shall now make system (2.17) more complicated by adding 
a third coordinate: 


X, = oO+ yy + A(X y,2) , 
xe (2.20) 
y = eyfi(X, y,8) 
Here k = 2,! = 1,m =0,n = 1; w is an irrational number, and the initial torus 
is obtained for y = € = 0, and 4, = 0. The normal form is 
ti, = © + yo + 80,(2,8) , 


u, = 1 + e@p(v,8) , 


b= evp(v, €) . 
We assume that 
W(v,e) =0 ; (2.21) 


then the matrix B is everywhere nilpotent. The set .o = @ is defined by the 
equation 
yo + Ep, = WED, , 


which for y #0 has a unique solution v = o(e) (near v = e = 0). If Q =(a,1) 
satisfies condition f and if (2.21) is also satisfied, then according to theorem 2.3 
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the function 6(e) is analytic, and for each sufficiently small ¢ system (2.20) has an 
invariant torus with a fixed frequency ratio @,(é)/w,(e) = @. If instead of a 
neighborhood of the torus 7 we consider the point mapping defined on the 
section x, = 0 by the solutions of the system, then we obtain an analytic variant 
of a theorem of Moser [Moser, 1962; 1973a, part 1, theorem 9}. Identity (2.21) 
holds either if the mapping preserves area, or if any loop intersects its image, or 
if the mapping is invertible, or if the divergence vanishes [Moser, 1966, § 10, 
example 2]. Here v plays essentially the same role as 6 in the previous example 
even though the system is no longer defined on the torus, but in a neighborhood 
of the torus. 


Example 2.7. In system (1.7) k and m_ are arbitrary,! =m, =0,n =k + 1, 
and Z") = 4 + 6@(X, Y,e), where 4 = (6,,...,6,) and ¢ are small parameters. 
Then in the normal form (1.12) 


P= 2444 eG (A, 8) , 
and the set .° is defined by the system 
4+ e@(4, 2) = Q(a— 1). 
which has the unique solution 
4A=A(e,a—1), 4(0,0)=0. 


That is, the set . is a manifold. Since / = 0 and @ does not depend on U, the 
matrix B is identically zero; consequently, ../ = &. According to theorems 2.3 
and 2.4, if Q satisfies condition f, then the set Re.«/ is an analytic two-parameter 
family of k-dimensional tori. In the family there is a one-parameter family F< of 
tori with frequencies Q. This family is found by solving the system 4 + e@") = 0 
with unique solution 4 = A(e,0). Since m, = 0, the set ./y = .o_ is invariant. 
Solutions in Re.«_ tend asymptotically to tori in Re. as t > +00. The subset 
Re(¥*)y is also an invariant analytic manifold consisting of a one-parameter 
family of tori with frequencies 2 and of solutions asymptotic to these tori. The 
analyticity of this subset is proved in theorems 3 and 4 in Bogolyubov’s lectures 
[Bogolyubov, 1964; see also Mitropol’skii, 1964, theorem 1]. 


Example 2.8. Let us consider the invertible system 


X = Q + FOLY) + eFH(X, Ye) , 
(2.22) 
Y = «G(X, Ye) , 
where 


FO(— xX, Ye) = FOX, Y, €) ’ 
G(—X, Ye) = —G(Xx, Y, €) > 


FOO) = 0 > Y =(¥1,---5 Yi) > 
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and ¢ is a small parameter. The initial torus is 7 = {X, Y,e: Y =0,¢ = 0}; A =0, 
and Q is assumed to satisfy condition (1.5). The normal form is 


U = 24+ GOV) + OV, 6) , 
(2.23) 
V=ce¥%(V8) . 


It may be shown that ®(V) = FV) since system (2.22) is in normal form for 
e = 0 [Bruno, 1973, corollary of material in section 2.2]. According to property 
5 of section 1.4 the normal form is invertible and 


P(V, 2) = OV, 8) , 
P(V, 2) = —P(V,8) . 


That is, ¥(V,2) = 0, while (V,¢) is an arbitrary series. The matrix B is every- 
where nilpotent. The set . = @ is determined by the equations 


Q + FOV) + eB (V, 6) = Qa 
or 
FOV) + e®O(V, 2) = Q(a— 1) 


If for V =0 the Jacobian det(@F/@V) # 0, then this system has the unique 
solution 


V=V(e,a—1), V(0,0) = 0 (2.24) 


depending on two free parameters. If Q satisfies condition f, then according to 
theorem 2.3 the family (2.24) is analytic. Here the situation is much as in example 
2.7: for a = 1, the family (2.24) has a one-parameter family of tori with frequencies 
2. That is, the initial torus 7 can be continued in the parameter ¢ [Moser, 1967, 
§ 6, section b]. 


2.4.2. The Set Z on Subspaces 


3) Let us consider the set @ on the coordinate subspace (2.8’). On the set 
A = AH the matrix Bis 


'AB/aU ed/ev' a@/av" 
B=|d¥'/0U 6¥'/aV'—L'a ow /av’ (2.25) 
av" sau ew" sav’ aw’ ev" — La} 


Here L’ = {A,,...,d,}, and L” = {A,4,...,4,} are diagonal matrices; the last 
column is calculated for V” = 0, and in that column there appear the parts of 
the series ® and ¥ which are linear in V”. The set @” = # AZ is singled out of 
the set .¢’ by the condition B**' = 0. 


2.4. Properties of the Set # = By, 


4) Ifthe subspace % is an integral subspace Y = #[A,,...,4,], then #” = 0. 
Hence, the matrix (2.25) is block triangular. It is nilpotent if and only if both of 
its diagonal blocks 


/ ad ad 
aU eV’ 
B=| : (2.26) 
OW” te 
SAEs ce — L'a | 
\eU av’ / 
pra oy (2.27) 
Sep ne 


are nilpotent. We shall take the parts of ¥” which are linear in V": 
I 
w= ¥ ¢fU,V' Eto, i=l4+1,...,1, 

jet 

and form the matrix C” = (c;;),i,j =1' + 1,...,. Then B” = C” — L"a. Note that 
in the integral subspace Y = ¥[A,,...,4,] the normal form (1.12) induces the 


system 
U = BU,V’, E) = Bly-co 
, (2.28) 
V' = P(U,V E) = P'lyrco 


If B’ denotes the set in Y which is the set # for system (2.28), then 
B ={U,V',E: U,V’, EB 6 J’, Bet = 0}. 


But the set @” = Bo Y is that subset of Z’ on which the matrix B” is nilpotent. 
Thus, 
Lot DB2RR’. 


5) Now consider the sets ./’, #’, and 2” outside small coordinate subspaces: 
Then these are the sets 
A'*= AN L*, B*=BaLt*, B*=BrAL*. 


sof’* and Z’* have the same meaning in #* as ./* and #* have in ¥*. The 
number of parameters on the set #’* is 


dt+n4+1—-(k4+l)-ds=n4+1—-(k+!l'). 


The set #’* is further defined by the | — /' equations which are obtained by 
equating to zero all coefficients of the characteristic polynomial of the matrix B”. 
But all coefficients y” of the characteristic polynomial of the matrix B” are series 
in V’ and expi<P, U>. These series contain only those terms 


V2 EX expi<P,U> , 
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for which 
1¢P,Q> + <Q, A> =0. 


Thus, if analogously to property 1! of section 1.4, we transform coordinates V’ 
to coordinates V’, then the coefficients y depend only on the “resonant coordi- 
nates” dy 441, -.-, 5, E. Hence the condition B”~" = 0 is a restriction only on 
the choice of fixed points in the coordinates 0,_44,,..., 0), £. Therefore, the set 
"* is invariant in system (2.28). The number of free parameters in #”* is 


opa=nt1—-(kK4+l)—-(-lMan41-(k 4+). 


It should be noted that the number of parameters in each component of the 
set # is unique, while the number of parameters in the components of the set 
may diminish with the expansion of the subspace. 

6) For some i < J, let 


4, # DY, qj +i P,Q) , (2.29) 
j#i 


for all integers q; > 0, where P € Z*. Then in the normal form (1.12) the series y; 
does not contain terms independent of »,, ie., W; = v;g;, where g; is a series in 
non-negative powers of V. The normal form (1.12) has the integral manifold 


L[A] = {U,V,E: v, = 0}. 
The set . splits into two components, 
A[i] = {U,V E: ® = Qa; by, = dv,a,j # i,v; = 0} , 
(i) = {U,V E: ® = Qa; Wy = 4,v;0,j # ig; = Aja} . 
Evidently, o = [i] U V/(i). Now we show that 
Ba Ai). (2.30) 


This need only be shown for points in #[/,], that is for points in [i]. For this 
the matrix B” is the scalar 
B" =g;—4,a . 
It is nilpotent only if 
g =4,a . 


But this is the condition which distinguishes the set (i) \ Y from the set .o [i]. 
This verifies (2.30). 
If inequalities (2.29) are satisfied for some i, say fori = 1,2,...,l', where l’ <1, 
then 
Bo AYOAAQQA OAT) = A(L,,2,...,1) , (2.31) 
where 


A(1,...,1)={U,V, E: ® = Qa; g,=4,4,i=1,...,1'0 b= Aan, j=l + 1,...,0} . 


§3. A Hamiltonian System 


3.1. Statement of the Problem 


We shall consider the case where system (1.1) is a Hamiltonian system with 
s degrees of freedom: 
ta) a 
u Z ae a Caer ae (3.1) 


to Ze4i 
i 0254; > STI az. > 


where f = f(Z,E), Z = (2;,...,Z25), Le, k + 1 + m = 2s. Let system (3.1) have a 
k-dimensional initial torus 7. Then it is possible to introduce local canonical 
coordinates X, Y so that system (1.7) is the Hamiltonian system 


Xj er) iS Taso = 1, Jk > 
© OY; Ox; 
(3.2) 
of of 
rape =— =1,...,.s—k 
Ve+j BYsaj > Vs+j Yes; ’ J : Ss ; 
where f = f(X, Y, E). If Y = 0 and E = 0, then 
of of of of 
—_ = WO; ’ —_= => = 0 . 
y; 6x; OVn+j OVs4j 


Moreover, for E = 0 the constant, linear, and quadratic terms (with respect to 
Y) of the series for f do not depend on X. The matrix A of the linear part of 
system (3.2) has k zero eigenvalues 


ApS Hd =0; (3.3) 
moreover, / pairs lie on the imaginary axis 
dings hateps.. “RARER. Fedak (3.4) 


and there are m_ = m, pairs of complex eigenvalues 
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Hy = —Um_+j > Rey, <0, j=l1,...,m- (3.5) 
Thus, . 
l=k+2I1, m=2m_=2m, , 


Wk+eltm_)=kt+lim=2s. 


First let us consider the case m = 0 when ail eigenvalues have zero real parts 
and k +1=~s. Then the normalizing transformation can be assumed to be 
canonical and the normal form (1.12) is the Hamiltonian system 


oh Oh 
= = —- ES 1k, 
4 6v; ; Gu; ’ te 
(3.6) 
oh oh ; i 
a e i See eS = 1, 4 xu 
ktj B0ysi4j k+T+j Ove4j J 
where 
h = ¥ hpgs VPES expi<P, UD (3.7) 


and the summation is carried out only for those Pe Z*,0< QeEZ,0<SeEZ" 
which satisfy equation (1.15) [Bruno, 1971, theorem 12 in § 12; 1970, theorem 1]. 
Thanks to property (3.3), the variables v,,..., v, and the parameter F appear in 
the series expansion of (3.7) with non-negative integral exponents. Equation (1.15) 
only imposes bounds on the exponents of the variables 0,41, ..., U+2i- 


Example 3.1. Let k = 1 = s,m = 0. Then A = Oand, thanks to condition (1.5), 
the solutions of equations (1.15) are P = 0, Q arbitrary. Thus, in normal form 
(3.6), the Hamiltonian (3.7) depends only on V = (v,,...,0,). According to (3.6), 
we have in the notation of (1.12) that 


A Ah 
Me ac Ft HE a fh: 


d = — = > = 
OV 6U 


The equations for the set of are 
= =a. (3.8) 
The matrix B is always nilpotent since 
B= ( hyy hyy ) = (° a 
—hyy —hyy 0 0 


Let n=0 and h=(O,V) +4<¢V,CV)> + O(|V|3), where C is a symmetric 
matrix. Then @h/OV = Q + CV +-:--, and equation (3.8) has the form 


CV =Qa—1)+-- 


3.1, Statement of the Problem 


If 
detC #0, (3.9) 


then according to the Implicit Function Theorem this equation has the unique 


solution 
V = V(a — 1) = C'Q(a — 1) + O((a — 19?) , 


where a — 1 may be considered to be a small parameter. In the general case where 
condition f and (3.9) are satisfied, then according to theorems 2.3 and 2.4 the 
k-dimensional torus 7 is located in an analytic one-parameter family of k- 
dimensional tori; their frequencies Qa are proportional to the set of frequencies 
Q of the initial torus. 


Example 3.2. Now suppose that in the previous example there is a small 
parameter ¢ (i.e., suppose that n = 1): 


h=<{Q,V> + 3<V,CV) + e€I,V)> + O((|V| + Jel)°), = const ; 


oh 
then ay = 2+CV + eff +°+:. Consider equation (3.8) for a = 1: 
C 


Q4+CV+ell+:=2, 
i.€., 
CV +el7+-°=0. 


Condition (3.9) implies that this equation has a unique solution 
V = V(e) = —eC TT + O(c?) . 


It follows from theorems 2.3 and 2.4 that, where conditions # and (3.9) are 
imposed on the Hamiltonian system with k degrees of freedom and one small 
parameter, the k-dimensional torus 7 lies in an analytic one-parameter family 
of invariant tori with the same set of frequencies 2. This result is theorem ! ina 
paper of Kolmogorov [Kolmogorov, 1954]. Conditions (3) and (4) given in that 
paper guarantee that our conditions f and (3.9) are met. In order to obtain the 
coordinate P used in Kolmogorov’s paper, it is necessary to set P = V — V(e); 
then the family is defined by P = 0. For k = 2 the corresponding theorem was 
proved by Barrar[Barrar, 1966], but its formulation is needlessly complicated. 
Indeed here there is a two-parameter (¢ and a) family of k-dimensional tori with 
frequencies Qa, and in this family there is a one-parameter family with fre- 
quencies Q2. Actually, system (3.8) for arbitrary a is 


CV+ef+-°°=Q(a-1). (3.10) 
It has a unique two-dimensional solution if the k x (k + 1) matrix 


(CQ) (3.11) 
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has at least one minor of dimension k which is not zero. We shall now consider 
the constant-energy continuation of the initial torus 7, that is, the continuation 
along the level sets of constant energy, h = const. To solve this problem it is 
necessary to add to system (3.10) the condition 


h=WV,Q4-"=0. 


All systems have a unique one-dimensional solution V = V(e), a = A(e), if 


C Q 
aet(¢ 0) #0. 


ie, ifat V=Oande=0 


hyy hy 
det : 
e i wr) x0 (3.12) 


The last condition was suggested by Arnold [Arnold, 1963, footnote on page 19], 
instead of Kolmogorov’s condition (3.9). In fact, it is enough to require that the 
matrix (3.11) be nonsingular, This condition is weaker than (3.9) and (3.12). 

If m > 0, then by the formal canonical change of coordinates, system (3.2) is 
carried into the Hamiltonian seminormal form: 


af 0 
ga ashe. Sm [Ny errg we: 
Cv; Gu; 
of of 
7. eS ’ Owing = — : ’ = 1, il ’ 3.13 
k+j Ogi, kt+itj Be, J ( ) 
of of 
Ww, = : 7 Vin ,=-—, = 1, 'z 
OWn 4 omen Ow; ; 7 


where the Hamiltonian 
f =h(U,V,E) + W(U,V,W,E) + f(U, V,W,E) (3.14) 


has the following properties: 

1. The h term in the series for f does not depend on W, and the expansion 
(3.7) for h has the same properties as the normal form (3.6). 

2. The term h contains all the terms in f which are linear in at least one of 
the two groups of coordinates, 


W_ = (W,,...,Wm_) OF Wy = (Wa ata--+2Wam_) - 
The expansion 
h=¥ MpgrsVOW*® ES expi<P, UD 


contains only those terms for which <R, Re M)> =0. 
3. f contains all terms of f whose degrees in W_ or W, are greater than 1. 


3.2. The Non-resonant Case 


It is not hard to show that, for the Hamiltonian (3.14), system (3.13) is a 
seminormal form in the sense of section 1.5. The system has formal integral 
manifolds 7, W_, W,, W;, but the coordinates W do not affect the sets xy and 
By which are defined only by the term h in (3.14). 


Example 3.3. Let / = 0, k and m arbitrary. According to examples 3.1 and 3.2, 
the real part of the set ./, = By contains an (n + 1)-parameter family, Re», 
of k-dimensional tori. Together with each torus 7° < Re.w, the set Re Hy, also 
contains the s-dimensional “tendrils” 7° and 7°, which tend asymptotically to 
the torus 7° and lie in the invariant sets Re.o/_ and Re.o/,. If Kolmogorov’s 
condition (3.9) is satisfied and if ¢ is a single small parameter, then there is a 
one-parameter real family of “tendrilled” tori with the same frequencies Q as the 
initial torus. According to theorem 2.3, condition f implies the family is analytic 
for this system. This result is the basic theorem in Graff’s dissertation [Graff, 
1971, 1974; see also Moser, 1973b, theorem 3]. Theorem 3 in Bibikov’s paper 
[Bibikov, 1973a] asserts only the existence of k-dimensional tori with frequencies 
Q and does not consider the “tendrils”. A particular case of this theorem appears 
in a paper of Moser [Moser, 1967, theorem 7], where there is an additional 
assumption that every y; is real and the matrix A is diagonable. 

Thus, for a Hamiltonian system with s degrees of freedom, a torus of arbitrary 
dimension k is structurally stable if k of its eigenvalues vanish and the remaining 
do not lie on the imaginary axis. Moreover in a system free of parameters these 
tori form a one-parameter family. 

There are similar results for invertible systems; see example 2.8 and Bibikov’s 
work [Bibikov, 1970a]. 


3.2. The Non-resonant Case 


Let 
1¢P,Q2> + deride, $+ Get tAnsi FO (3.15) 


for all nonzero integral P and q;. Thanks to (3.3) and (3.4) all solutions P, Q of 
equation (1.15) are such that P = 0, q,,..., q, are arbitrary, q,4; = 447+; for 
j=l,..., 1 Set 


Paty = Un+jUeri+g » i= Rcsegh ' (3.15') 
If condition (3.15) is imposed, then the Hamiltonian (3.7) depends only on v,,..., 
Ves Petts +++> Pray Therefore, the normal form (3.6) has the form 
0 
U; ate: b; = Gh oy, , PSM ci Gikss, 
Ov; ou 
(3.16) 
Oh oh 


ca 
Il 

= 
bass « 


Dy jp = eae yTT > Onsi+g = = ’ 
ktj +4 
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According to (3.15) inequality (2.29) is satisfied for alli = k + 1,...,k + 21 Thus, 
according to (2.31), 


Bc Ak +1,...,.k +20 
(3.17) 
oh fe 
U, Vee wai=1,...,k;——=Aajek+.,..,k+0). 
00; Op; 


] 


Since the Hamiltonian h does not depend on U, then, if system (3.16) is written 
in the form (1.12), we have ¥, = +: = w, = 0,06/0U = Oand d¥/aU = 0. Hence 
the matrix (2.4) is nilpotent together with the matrix 


Bore (H- sana) 2 ij =k+t I, ee. k+ al : 
J 


where 6,; is the Kronecker symbol. According to (3.16) and (3.17) the matrix Bo 
on the set #(k + 1,...,k + 2/1) is the block matrix 


D,;HD, D,HD 
Be ate (3.18) 
~D,HD, ~—D,HD, 


) = 
= ’ Rjok+ti.i kel, 
(nan : 


where 


Di = {Ope 15 +05 Pst} , D,= {Upstets +++ Uneaty ’ 


(D, and D, are diagonal matrices). It is not hard to see that the square of matrix 
(3.18) is the zero matrix since D, D, = D,D,. Hence, according to (3.17), 


B= bk+i,...,k +21. (3.19) 
First let us consider the properties of the complex set 4. The number of 
variables v,,...5 Ug Patio <--> Pert E, aisk +1+n+ 1. According to (3.17), the 


number of equations defining the set (3.19) is k + | Thus, the number of free 
parameters in the set (3.19) is 


k+]ltnti-(k+Deand41. 


Now let 
p; =U; , C= 1, ,k ’ 
Pury = ies; » ja,...,0; 
. (3.20) 
P= (Bis -+ +5 Bes Betio++> Peri) 3 
D = (45006. Des gagye-estdgaz) - 
Then 


h = h(E) + (P,Q> + 3<P,CP> + <P, TIE) + O((|P| + 1EI)) , (3.21) 
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where C is a square symmetric matrix of size k + L HTis a rectangular matrix of 
size (k + 1) x n. According to (3.17) the set (3.19) is defined by the system of 
equations 

Oh/OP = Qa , (3.22) 


which according to (3.21) has the form 


O+CP+ME+::=Qa, 
i.e., 
CP + HE +: =Q(a—1). (3.23) 


We shall find conditions under which the solutions of system (3.23) form a 
manifold in the coordinates P and E, ie., system (3.23) can be solved for the 
parameter a — 1 and for k + /— 1 of the coordinates in P, E. The following 
suffices: 


rank(C®) =k +1. (3.24) 


In fact, as long as 2 # 0, in the presence of condition (3.24), the matrix (C/7®) 
always has a minor of size k + I which is not zero and contains the column 9. 
Using this minor in the Implicit Function Theorem, we obtain the required 
solution of system (3.23). According to (3.15’) and (3.20) the coordinates p,41,..., 
Py. are expressed nonlinearly in local coordinates V. Therefore, the set (3.19) is 
a manifold in the local coordinates V if the system of equations (3.23) can be 
solved for a — 1 and for k + / — 1 of the variables Py, ..+5 Py E, in such a way 
that the coordinates 6,4, ..-, 6+; are free parameters. A sufficient condition for 
this to occur is that 

rank(C/78) =k +1, (3.25) 


where the size of the matrix C is (k + 1) x k; C consists of the first k columns of 
C. This is only possible if ! <n + 1. Finally, the set (3.19) is a manifold in the 
coordinates V for all small values of the parameters F if 


rank(C®)=k+i, (3.26) 


which is possible only for 1 < 1. 
Now let us consider the properties of the real part of the set (3.19). The 
corresponding reality relations for V are 


= 0; , Dalek 3 
Upeg = “Wyerieg > j=1,....1. 


See, for example, Siegel’s work [Siegel, 1956, the end of § 13]. Hence, v; is any 
real number, and 


1 
Sane Vee) NS + Ag Uys; > 
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i.e., according to (3.15’) 
™ 
arg Py+j = ATE Vy 4; + ALY +745 = > 


and 
Puri = — Peri - 


This means that p,,; has pure imaginary values and Im p,,, > 0. Thus, according 
to (3.20) the reality relations for P are 


ImP=0, far 29,..., Peg BO” (3.27) 


Note that, for the solution of the normal form (3.16), P = P° = const. The torus 
Fc ReB has k + j frequencies if ! — j of the numbers pp,,,..., Oe4; are zero. 
The vector © and the matrix C in (3.21) are real if the Hamiltonian h is real. 

Now let us consider the situation for several different values of |. The case 
] = 0 was taken up in examples 3.1, 3.2, and 3.3. 


i=1. 1, Suppose (3.26) is satisfied, i.e., 
det(CQ) #0. (3.28) 
Then system (3.23) has the unique solution 
G=ApE), Pr =Plbev£), i= 1,..,k. (3.29) 


Restrictions (3.27) are reduced to the single restriction, p,., > 0. If 6,4, = 0 then 
(3.29) gives k-dimensional tori in the set Re&, while for 6,,., > 0 there are 
(k + 1)-dimensional tori in the set Re@. Thus, the set Re# consists of two 
families F,* and F,*;\', where Fy is the boundary of 7,4’. 

If n = 0, then the family %* = F< consists solely of the initial torus 7, while 
the family ¥,**4' = F**" is the one-parameter family of (k + 1)- dimensional tori 
emanating from the initial torus 7. That is, for a system in general position the 
k-dimensional torus with a single pair of pure imaginary eigenvalues is the 
boundary of an analytic family of (k + 1)-dimensional tori all of whose eigen- 
values have nonzero real parts. 

If n=1, then AX = Fk and FX! = ¥#*". For each value of the single 
parameter ¢ there is a family of (k + 1)-dimensional tori terminating at a k- 
dimensional torus. In figure 1 there is a representation of the projection of the 
set @ on the plane 6,, 6,4, for various values of s. The heavy lines denote the real 
parts of the projections of the set Re #, while the marked points correspond 
to k-dimensional tori. Thus, if the initial torus Z has only one pair of pure 
imaginary eigenvalues, and if conditions 8, (3.15) and (3.28) hold, then, according 
to theorem 2.3, near the torus 7 there is an analytic family ge bounded by 
the analytic family F*. The existence of the family A under the additional 
condition m_ = 0 was shown by Moser [Moser, 1967, theorem 6]. Moser’s 
condition that the Jacobian of the vector (6.15) for V = 0 not be zero is equivalent 
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Fig. 1 


to our condition (3.28). In fact Moser’s vector (6.15) is made up of the components 


a; _1¢h 


0 22% 2k. (3.30) 
a “00; 


where a = ih =i), +--+. The Jacobian of the vector (3.30) is 


D+ 1PK+2 


det (2 (*) | i, = det (Stee = rhea, 
a OE SS ow, 
(3.31) 
h, 


1 h 
= ar oet(, i Se ht \i ij=lsk. 
ae We ses ites the voces 


Now note that for V = 0 the last matrix is the matrix (CQ) of (3.28). On the other 
1 


A,,. Thus, the second 
24 ? 


1 
hand if we set 4 =’, then 4,, = 2aa,, and —a, = 
a J 


determinant in (3.31) can be written in the form 


1 11 1 ig *h 
det { —h,,, -—_——h,.4,)=~—det| °% |.” ij=1,...,k . (3.32 
‘ (: We A ) Auk °° be ms) bd 2) 


As long as « = id, #0 for V =0, then the determinants in (3.28), (3.31) and 
(3.32) vanish simultaneously. Note that Moser’s condition (6.16) has a mis- 
print: the factor 2 in front of 4 in (3.32) is missing. Moser used some weaker 
arithmetical conditions on Q and A than f and (3.15) [Moser, 1967, theorem 6] 
since he was looking only for the family ¥; and not for the family ¥*!. In §5 
it is shown how the arithmetical conditions may be weakened for families located 
in coordinate subspaces (in the given case, on the subspace v,,, = 0,42 = 0). 
i = 2. Restrictions for coordinates (3.27) are given by 


Perr 20, Perr 20. (3.33) 
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First we consider the case n = 0. Since / = 2>n+ 1 = 1, then condition (3.25) 
is not satisfied. Supposed (3.24) is satisfied, where to be specific we assume that 
the matrix (QC) has its extreme left-hand minor different from zero. Then the 
solution of system (3.23) has the form 


A = A(y42) = 1+ YoPer2 +77" 
; (3.34) 
Pi = PilPe+2) = ViPer2 to, i=1,..,k4+1. 


We shall consider two fundamental cases: 

1) +1 <0; then 6,,, and 6,,, in the solution (3.34) have different signs; 

2) ye+1 > 0; then 6,4, and 6,,, in the solution (3.34) have the same signs. 

In the first case the set Re # consists of a single initial torus 7 since only on 
ZF are both inequalities (3.33) for the solutions (3.34) satisfied. In the second case 
the set # contains the component ¥*? with p,,.,; > 0 and p,,, > 0. The initial 
k-dimensional torus 7 bounds the family 7;*?. 

If n = 1, then for fixed ¢ the set #@ has a single parameter. As before, let 
rank(@C) = k + I. If for e = 0 we have case 1, then the construction of the set 
Re @ depends upon the sign of ¢. For one sign (say, ¢e < 0) the set Re@ is 
empty; for the other sign (¢ > 0) the set Re@ consists of the family F#**? 
bounded by two (k + 1)-dimensional tori. See figure 2 for a representation of the 
projection of Z with ¢ fixed on the plane 6,1, 6,42. The heavy lines denote the 
real part of B (the family #**?) while the marked points denote the k and 
(k + 1)-dimensional tori. If for ¢ = 0 we have case 2, then for any fixed ¢ 4 0 we 
have a family ¥**? bounded by a single (k + 1)-dimensional torus (see figure 3). 
In both cases the (k + 1)-dimensional tori form two one-parameter families (with 
parameter ¢) merging with and vanishing into the initial torus 7. 


i > 2. We shall consider only the case where n + 1 > / and (3.25) holds. To 
be specific, we assume that the left minor of the matrix (QC//) is different from 


Fig. 3 


zero. Then system (3.23) has a unique solution of the form 
A= G(Pyriy +++ Paris Ebr e-+s En) 
Bi = PDetisee+> Per Sire),  E=d ik, (3.35) 
&) = E(Dyats +s Petis liso En) 9 j=l,...,J-—1. 


The real part Re @ is defined by conditions (3.27). If among the numbers /,4,, 
.+> Pyat 1 Of them are set equal to 0 (0 <r < /) and the remaining are variable, 
but positive, then the solution (3.35) gives a family .%,4"/-/. For each r we have 


i 
rii—n)! 


oe 
i= 


different families of this kind. For r = 0 we obtain the unique family wast it 
T=n-+ 1 then the unique family with r = Tis Fo which consists of the single 
torus J. Ifi <n, then the family with r = lis F, ik ,-j and contains k-dimensional 
tori for all sufficiently smail ¢;, ..., &,. 


Theorem 3.1. If conditions B, (3.15) and (3.25) are satisfied in system (3.2) with 
T<n+1, then according to theorem 2.3 there are the analytic families F,\'-7, 
r=0,..., 1 which adjoin the initial torus 7. For sufficiently small €;, ..., & and 
arbitrarily small 6 > 0 there are tori with k + 1,...,k+ I distinct frequencies 
in the region |Y| <0, |e,| +-°° + jez_y| < 6. Uf I <n, then there is also a k- 
dimensional torus. 


Krasinskii considered the case n = 7 + 1 [Krasinskii, 1969], the smaliness of 
the disturbance g in the Hamiltonian (2) playing the role of ¢,,, while u,,...,u, 
in Krasinskii’s paper take on the roles of ¢,,..., ¢7, and his conditions (3) and (9) 
guarantee the fulfillment of our conditions f and (3.25) respectively. According 
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to Krasinskii’s basic theorem, for every sufficiently smail ¢;,, there are values 
of the remaining parameters ¢, = é; for which the system has an invariant k- 
dimensional torus with frequencies Q. In fact in this case the family F,*,,_; lying 
in Re is Fs, and for a = 1 it gives a one-parameter family of k-dimensional 
tori. According to Theorem 3.1 in the setting of Krasinskii’s basic theorem there 
exist conditionally periodic solutions with an arbitrary number of frequencies 
fromk tok +1. 


Remark. All the results of this section easily carry over to invertible systems 
in the non-resonant case. In that setting the normal form is 


U=G(P,£), P=0. 


where the variables are 


U =(U, tigar,---s tad > P ’ 


with a4; = are(%+; + iv,47,,;). Here the matrix B, is everywhere nilpotent (see 
the property 1 in the subsection 2.4.1); i.e. the matrix B is nilpotent outside of the 
coordinate subspaces. The set (3.19) is defined by the system @(P, E) = Qa, which 
is analogous to system (3.22). 


3.3. Resonances 


We say that system (3.2) has a resonance of multiplicity k if the equation 
1<P,Q) + qresdiny °° + Qes74na7 = 0 has k linearly independent solutions 
with integer p,, --., Pxys Ge+1> +++» Gri. Thanks to property (1.5), k <i. The 
non-resonant case (k = 0) was analyzed in the previous sections. We shall con- 
sider the case of a resonance of multiplicity 1, k = 1. We shall begin with the case 
f=1. 


i = 1. The simplest case in this setting is when k = 1 and the initial torus 7 
is a periodic solution. Resonance occurs if the number A = 4,,,, /(ia,) is rational: 
A = p/q, where p and q are integers. The case m = n = 0 was examined in detail 
by Bruno [Bruno, 1970, 1972]. It turns out that for |g| > 3 three formal families 
of periodic solutions adjoin the initial torus 7: the basic family # corre- 
sponding to the coordinate subspace, 


Y= [0] = {U, V, Ez bya, = 00 = Ves 27 = 0} 3 


and two additional families, #* consisting of unstable, and F~ consisting 
of stable periodic solutions. All of these families are of type F,', ie., of one- 
parameter type (see figure 4). The solutions of the basic family have period close 
to the period 2x/c, of the initial solution. The solutions of the other two families 
have periods near 22q/w,. These two families form a formal set with an algebraic 
singularity at the initial solution 7. According to theorem 2.4 all of these families 
lie in the set ./, and according to theorem 2.2 these families are analytic. 


Fig. 4 


All of these properties carry over to the general case of arbitrary m and n, 
except that now the families have n + 1 parameters. Finally, for any k > 1 the set 
Re & consists of the same components ¥ < Y[0], F~ and F*; they are all 
families of type ¥,“.,. When the additional families #* and F~ approach the 
basic family ¥, tori of ¥* and F~ tend to q-fold tori of F. Now, however, 
the set @ < so is a set of analyticity. It may be shown that the set # lies in the 
basic component of the set .o/, 


Oh 
U.K Et = wai = 1,...,K5Upa, = Vera =o ‘ 


and is defined in that component by the equation 


Oh 


oo sa. 
OV p41 OVp42 


That is, the set Re # is the family F,". This family is a manifold if the determinant 
(3.31) does not vanish. In fact the family F is just that introduced by Moser 
[Moser, 1967, theorem 6]. Moreover, there are no other analytic families passing 
through the initial torus 7. 

Now let us take up the case of “general position” with | = 2. 


1 =2. We shall limit our attention to the simplest case k = m =n = 0, 1.e., to 
a Hamiltonian system with two degrees of freedom in the neighborhood of a 
stationary point. It is assumed that 2 = /,//, is rational. Many studies have been 
made of the family of periodic solutions in this case [see, for example, Henrard, 
1973, concluding remarks]. We shall briefly summarize the results. If A and 47? 
are not integers then the normal form has one-parameter Lyapunov families 
(see example 4.2): 
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Fig. 5 


Fig. 6 


FH =(Viv,=04=03, £7? {Vi0, =v, =0}. 


Moreover, there are two one-parameter families of periodic solutions F~ and 
F*. There are two cases (see (3.34)) depending on the signs of the coefficients of 
the normal form: 

1) The families F~ and F* are pure imaginary, 

2) The families F~ and F~* have real parts. 
In figures 5 and 6 the families #', Y?, F , and F* are represented in the f,, 
pf, plane for the respective cases 1) and 2) above. The heavy lines in these figures 
correspond to the real parts. If 47! or A are integers different from 1, 2, then the 
families Y', ?, ¥~, and F* remain; however, the family #! (or #7), is nota 
manifold, but has an algebraic singularity at the origin together with the families 
F~ and F*. According to theorem 2.4 


Rew =Re¥'UReY*UReF VUReF* , 
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Fig. 8 


and by theorem 2.2 all of these formal families are analytic since condition « is 
trivially satisfied [Meyer, et al, 1970, 1971; Henrard, 1973]. 

If there is a single small parameter ¢ in the system (e., ifn = 1), then for each 
fixed small e # 0 the families Y' and Y? remain where they were, but the families 
F~ and ¥* change approximately as shown in figures 7 and 8 for the respective 
cases 1) and 2) (for « = 0). Now in the case 1), new real branches of the families 
¥F~ and F* appear. These are called “bridges” and connect the resonant periodic 
solutions of the families ¥' and #?. 

In the case of arbitrary k we note that the number of parameters in the set 
* isn + 1, while the number in the set isn + 1 — k. 
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4.1. The Normal Form on an Integral Manifold 


Let 0 < /' <1 and suppose that in system (1.7), 
Av #ICP,Q> + qi A, tot qpay, fol +t... (4.1) 


for all Pe Z* and non-negative integers q,,..., q,. According to property 2 of 
section 1.4 the normal form (1.12) has a formal integral manifold 


L = LUAyy dy] = {U,V EV" =0} . (4.2) 


Here and below every vector V = (v,,...,0,) of dimension / will be divided into 
two subvectors V’ = (v,,...,0,) and V” = {v;4,,...,v,} of dimensions /’ and 
| — I’, respectively. 
First assume that m = 0. On the manifold (4.2) the normal form (1.12) becomes 
the system 
U = B(U,V’,0, E) = B(U,V',E) , 
: - (4.3) 
V' = ¥'(U,V',0,E) = #(U,V'E) , 


which is again a normal form. Let (1.11) be a normalizing transformation of (1.7) 
into (1.12). Instead of (1.11) let us consider the transformation 


X=U+5(U,V',0,AE)SU+eE, 
(4.4) 
Y=V+H(U,V0E)=V+4H. 
Here the variables U and V differ from the variables U and V of §1. Let the 
transformation (4.4) carry system (1.7) to the system 
U = ©*(U,V,E) , 
. (4.5) 
V=¥*(U,V.E) . 


Since transformation (4.4) and the normalizing transformation (1.11) coincide on 


the manifold (4.2), then for V” = 0 we have 


O*=h, Pea werneg. 


Consequently, the manifold V” = 0 is invariant for the system (4.5), and on that 
manifold system (4.5) induces the normal form (4.3). That is, in the expansions 


O* = Piy,V2ESexpi(P, UD , 


gt = Wk /0; = 3 dibosVOEF expicP,U) , a 

the coefficients Ppgs and gipgs with j < I’ vanish if 
Q”=0, iKP,Q> + (0A) #0, (4.7) 
while gjpgs = 0 for j > I ifq; = —1. A formal system (4.5) whose expansion (4.6) 


has property (4.7) is said to be normalized on the manifold (4.2). For example, the 
normal form (1.12) is normalized on all of its formal integral manifolds. 


Theorem 4.1. Suppose that in system (1.7) m = O and the vectors Q and A satisfy 
inequalities (4.1). Then under restriction 3 there is a formal change of coordinates 
(4.4) which carries system (1.17) to system (4.5), normalized on the formal integral 
manifold (4.2.). 


If m # 0, then in §1 system (1.7) was reduced to the seminormal form (1.17) 
by means of the transformation (1.16). We denote by = and #, the functions 2 
and 9; for V” = O(j = 1,...,/ + m) and we carry out the transformation 


X =U + 5(U,V',W,E) , 
y, =0,+9(U,V'’,WE) , i=1,...,0, (4.8) 
Vij = Wet (U,V, WE) : J=l,...m. 


The coordinates U, V, W differ from the analogous coordinates of § 1. As a result 
of the transformation (4.8), system (1.7) is carried into the system 


U = O*(U,V,E) + B*(U,V,W,E) , 

V = ¥*(U,V,E) + P*(U,V,W,E) , 
W_ = X*(U,V,W_,E) + X*(U,V,W,E) , 
W, = X3(U,V,W,,E) + X#(U,V,W,E) , 


where the right hand sides are in the ring A,[[V,W,E]]. For V" = 0 all the 
starred functions in system (4.9) coincide with the corresponding unstarred 
functions in the seminormal form (1.17). We say that the formal system (4.9) is 
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seminormalized on the manifold (4.2) if 

1) The series expansions (4.6) for 6* and ¥* have property (4.7); 

2) For V” = 0 the series &*, ¥*, ¥*, and X* have properties 2 and 3 in the 
definition of a seminormal form (adding asterisks to the terms in that definition 
as appropriate). 

Therefore, for V” = O system (4.9), seminormalized on manifold (4.2), also has 
the same properties as the seminormal form (1.17). We note three of them: 

1. The manifold 4, = {U,V,W,E: V” = 0} is not, generally speaking, an 
integral manifold. 

2. The manifolds 


L_ = {U,V,W,E:V" =0,W, =0}, 
L, ={U,V,W,E:V" =0,W_ = 0} 


are integral manifolds. On each manifold system (4.9) induces a “triangular” 
system of type (1.21) with variables V’ instead of V. The manifolds Y= Yy 0 W; 
are also invariant. 

3. GY = $Y AF, isan integral manifold on which system (4.9) induces the 
normal form (4.3). 


Theorem 4.2. If, in system (1.7), Q and A’ satisfy inequalities (4.1), then under 
restriction 3 there is a formal change of coordinates (4.8) which reduces system 
(1.7) to system (4.9), seminormalized on the formal integral manifold (4.2.). 


4.2. Questions of Convergence 


Let us first consider the classical formulation of the question of the con- 
vergence of transformation (4.4) or (4.8): 


Problem 4.1. Under what conditions on the normal form (4.3) (or, more gener- 
ally, on (1.12)) is the manifold Y (or LY or Z,) analytic? 


Condition &'. The numbersiaw,,...,i@,,,,..., Ap are pairwise commensurable. 
0 is considered to be commensurable with every number, including zero. 


This condition guarantees that there are no small divisors in transformation 
(4.8). 


Condition A’. There is a series a = a(U, V’, E) such that in the normal form (4.3) 


B=Qa, P=dvoa, jH,..l. 


Jj 


Theorem 4.3. In the setting of theorem 4.2 suppose that Q and A’ satisfy 
condition e' and the normal form (4.3) satisfies condition A’. Then transformation 


(4.8) converges in some neighborhood of the initial torus 7 . Hence, the manifolds 
YL, L$, and L, are analytic. 


The advantage of this theorem over theorem 2.1 is in the weakened conditions 
on the eigenvalues and on the normal form. On the other hand, analytic trans- 
formations are obtained only in certain subspaces. Note that thanks to condition 
(1.5) we have that k = 0 or | and restriction 3 is automatically satisfied. For 
k =m=0 theorem 4.3 is a consequence of results of Bruno [Bruno, 1971, 
theorem 3 in §10; or 1974a, theorem 1, which contains all earlier cases]. That 
condition A’ is necessary for the analyticity of the manifold & was shown by 
Bruno [Bruno, 1974a, theorem 2; also 1971, theorem ITI]. 


Example 4.1. Let k = 1 and suppose that 4,,..., 4, are all different from 0. 
Then in the subspace 


£= $[(0] ={U,VWE: V =0,W =0} 
condition A’ is 
QO (E) = aw, : 


and is always satisfied. According to theorem 4.3 the periodic solution 7 may 
be continued analytically in the parameters E£ if all of its eigenvalues are different 
from zero. This is Poincare’s theorem [Poincaré, 1892, v. 1, Ch. 3]. 


Example 4.2. Suppose that the original system is the Hamiltonian system 
(3.1), k = 0, the pure imaginary numbers 4, = — 4, # 0, and the numbers A,//,, 
..., 4,/A, are not integers. Then the subspace Y = #[/,,A,] is invariant in the 
normal form (3.6). The normal form is also Hamiltonian on #: 


db, = —-> Te (4.10) 


where 
— = 991 9 
h = hly=..=o-0 = > Aga P UP » 


Here the summation is over those integers q,, g2 > 0 which satisfy the equation 


4,4; + 4242 = 0. Since A, = —A, we have that q, = q,. Thus, h is a series in 
non-negative integral powers of p = v,v,. The normal form (4.10) has the form 
, ah oh 
v= "TD 5 v2 = 7 as ; 


ah 

Condition A’ is satisfied with a = Tab According to theorem 4.3 the manifold 
19p 

¥ is analytic and contains an (n + 1)-parameter family of periodic solutions 


adjoining the initial stationary point 7. This is the theorem of Lyapunov 
[Lyapunoy, 1892, Ch. II, 33-41]. See also some remarks of Bruno [Bruno, 1971, 
§ 10, example 3]. 
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Problem 4.2. On which formal set M' does the transformation (4.8) converge? 


Let us define the set 


dy = {U,V,W,E: V" =0,6 = Qa,h, = 4,v,a,j = 1,...,1} 


where a is a free parameter. Evidently we have that oy = Ly rn Sy. 


Theorem 4.4. Suppose that Q and A’ satisfy condition in the setting of theorem 
4,2. Then the transformation (4.8) converges near the initial torus 7 in the set Sy, 
which is itself an analytic set. 


Let us suppose that condition @’ is satisfied for the manifold # and that 
theorem 4.4 is applicable. If condition « is satisfied, then theorem 4.4 is a 
particular case of theorem 2.2. If condition is not satisfied, but condition f is 
satisfied, then theorem 2.3 gives on %y only the analyticity of the set By A Gy, 
which is smaller than the set .j,. Finally, it is possible that condition f is not 
satisfied and condition @’ is. In the last two cases theorem 4.4 gives new results 
in comparison with theorems 2.2 and 2.3. 


4,3. Periodic Solutions 


In subsection 2.3.2 there was introduced for each coordinate subspace # or 
£ an index of irrationality r’ = r’(#), the maximal number of elements of the 
set {iw,,...,i@,34,,...,4,} which are linearly independent over the rational field. 
Thanks to (1.5), we have that r‘ > k. Condition «’ is satisfied only for those 
submanifolds ¥ for which the index of irrationality is 0 or 1. 

First let us consider the case r’ = 0, which occurs only for k = 0 when the 
torus 7 is a stationary point. Let the manifold 4 = #([0]; that is, 


An= = Ap=0, A, #0, jol. 


On the manifold Ay the set .j is defined by the system of equations ¥’ = 0, 
ie., the set of’ = oA OF is a formal family of stationary points. According to 
theorem 4.4, it is analytic. 

Now let us consider the manifold ¥ for which r’'(#) = 1. According to 
theorem 2.4, the set Re »#’ consists of a family of periodic solutions adjoining the 
initial solution 7. By theorem 4.4 the set ./ is analytic, and we have the following 
consequence: 


Corollary 4.1. All formal families of periodic solutions adjoining the initial torus 
7 and lying outside the manifold £[0] for k = 0 are analytic. 


It was shown in example 2.3 that for k = 0 there may be a formal non-analytic 
family of periodic solutions in the subspace [0]. In an earlier paper [Bruno, 


1970] it was hypothesized that all formal families of periodic solutions adjoining 
J are analytic. Now it is clear that this hypothesis is true only if the formal 
families in [0] for k =O are excluded. In another paper [Bruno, 1974a, 
theorem 3] the analyticity of certain families is asserted. These are the formal 
families of periodic solutions which are manifolds and for which the period can 
be expanded in a Taylor series in the variables V, E. This property is only 
possessed by families lying outside # [0] for k = 0. But for families lying in 7 [0] 
with k = 0, the period has a singularity of pole type at the stationary point; Le., 
in these families the period tends to infinity near the point 7. For example, the 
period of the solution (2.13) in example 2.3 is 


2m a 


4/h vt + 3) 
Problem 4.3. Find all families of periodic solutions adjoining 7 . 


In order to solve this problem it is at first necessary to find all formal integral 
subspaces Y with index of irrationality r'(#) = 1. Then in each of these sub- 
spaces it is necessary to find the set Rew’. If k = 0 and some 4; = 0, then there 
is some ambiguity about the formal families of periodic solutions ¥[0]. In all 
other cases the indicated solution of problem 4.3 is complete. 

It is also possible to consider the complex version of problem 4.3. Let 
Z = Z(Z°, A®, t) be the general complex solution of system (1.1) which has value 
Z° fort = Oand 4 = A°. This solution is periodic (or contains a “complex cycle”), 
if for some complex number ¢° # 0 the equality Z(Z°, 4°, t°) = Z° is satisfied. 
The number ¢° is the period of this solution. It is easy to see that for every 
subspace # with index of irrationality r'(#) = 1 the set wm consists of 
complex periodic solutions (cf. theorem 2.4). 


Example 4.3. Let k = 0, / > 2, n = 1, m arbitrary; m,/A, + m,A, = 0, where 
m, and m, are relatively prime positive integers and 4; 4 q,4; + q242,j = 3,..., 
I, for all integers g,, q2 > 0. In the integral subspace 


FL = L(A, Az] = (Vie: 03 =" =v, = 9} 


the normal form (4.3) has the form (2.15’) 


0 


G2 Gnle(vt vz2)* = vjg; , j=1,2. 


It was shown in example 2.4 that the set . for this system consists of three 
components »', of”, and 0°. If condition (2.16) holds, the set .7? is a manifold 
of the form ¢ = é(vf''v2). The quantity 6 = vf" vz? is constant on each solution 
in the set 2/3. According to (2.6) the solutions in ./> are complex periodic “cycles” 
of period 
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2nim, 
g2(6,é) © 


By theorem 4.4 the component .x? is an analytic family of cycles. If a new time 
t = tt is introduced as in example 2.4, where t is some complex number, then we 
obtain new eigenvalues A; = 4,/t. On the complex plane the points 4, and A, are 
found on some line passing through the origin. The analyticity of the set 7? is 
preserved and we obtain theorem | of Pyartli [Pyartli, 1972]. If the original 
system is real and the eigenvalues 4, and A, are pure imaginary, then the periodic 
solutions of the family .o7? are real. 


§5. Integral Manifolds with Small Divisors 


Now let us consider the formal integral manifold (4.2) for which the condition 
x’ is not satisfied and l’ < |. Thus, in the calculation of this manifold small divisors 
appear, and the normal form (4.3) on the manifold (4.2) is no longer sufficient 
to answer the questions raised in problems 4.1 and 4.2. 


5.1. Formal Analysis 


First let m = 0. In the normal form (1.12) we isolate the terms of ¥” which 
are linear in V”: 


U 
w= Yo cyte, isl +1,...,1, 


jo 41 


where c, depend only on U, V, E, and we form the square matrix C” = (c)i-415 
where i,j = [' + 1,..., 1 The system 


U = G(U,V’,E) , 
Vi= ¥(U,V',E) , (5.1) 


V" =C"(U,V',E)V" 


is called the complete normal form of the integral manifold (4.2) [Bruno, 1971, 
§ 10]. The right hand sides of system (5.1) are made up of those terms from the 
right hand sides of system (1.12) for which ||Q’|| < 0 (in the notation of (1.13) and 
(1.14)), Le. either Q” = —E, or Q” = E; — E,, i,j =1' + 1, ..., | Now we shall 
apply to system (1.7) the transformation 


X=U+E(U,VE), 
Y=VW+4A(U,V’E), (5.2) 
y"=vy" + A’*(U, Vv’, E) . 


The coordinates U, V here are different from the coordinates U, V in §1, but 
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the transformation (5.2) is connected with the normalizing transformation (1.11) 
by the equalities 


a 
= 


=, =H’ for v"=0; 
i.e, 2 and A’ are the terms of Sand H’ which are independent of V”, while A’* is 
the part of the series H” which is constant or linear in V”. Such a transformation 


reduces system (1.7) to the form 


U = ®**(U,V,E) , 
. (5.3) 
V = Y**(U,V,E) , 


where for V” = 0 there is obtained the complete normal form (5.1) on the 
manifold #: 


= ‘ = OW Re" 
Prt =@, b Aiaieiae— liale = C" . 5.4 
ov" ( ) 
If we set 
pt* = Y OIB.VPE expicP, UD , 
(5.5) 
gt* = WF*/0, = ¥ giposVOES expi<P,U>, = f=ty...l, 
then 
O35,=0 and geos=0, if 
(5.6) 
WQ"| <0, = i<P,2>4+ <O,A> #0. 
That is, among the coefficients 33, and g75s5 with ||Q” || < 0 only the resonant 


terms for which equality (1.15) is satisfied are different from zero. A system (5.3) 
with this property is said to completely normalized on the manifold (4.2). 


Restriction 3’: Let y(Q) > 0 for all vectors Q € N', with ||Q"|| < 0 (the quantity 
y(Q) was introduced in section 1.4 before restriction 3). 


Evidently, restriction 3’ is weaker than restriction 3; for k =0 or 1 it is 
automatically satisfied. 


Theorem 5.1. Suppose that the vectors Q and A in system (1.7) with m =0 
satisfy inequality (4.1). Then under restriction 3' there is a formal change of 
coordinates (5.2) reducing system (1.7) to system (5.3), which is completely normalized 
on the formal integral manifold (4.2). 


If m # 0, then in § 1 system (1.7) was reduced to the seminormal form (1.17) 
by means of transformation (1.16). Let us denote by & and 4; the series = and n; 
for V" = 0, respectively, while 4* denotes the terms in the series y; which are 
constant or linear in V”. Now together with the transformation (1.16) we make 
the transformation 
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X=U + E(U.V’, WE) . 


w= +HUVWE), i=1,...,1, 


(5.7) 
y=utHRUVV' WE), galtl.al, 
Vien = Wet Ha (U,VWE) , k=1],...,m. 
This transformation reduces (1.7) to the form 
U = ®**(U,V,E) + B**(U,V,W,E) , 
V = ¥**(U,V,E) + **(U,V,W,E) , (5.8) 


W = X**(U,V,W,E) + X**(U,V,W,E) , 


where the right hand terms lie in the ring A,[LV, W, E}]. A system of form (5.8) 
is said to be completely seminormalized on the manifold (4.2) if 

1) it is seminormalized on the manifold (4.2); 

2) the series @** and ¥** have properties (5.4) and (5.6). System (5.8), since 
it is completely seminormalized on the manifold (4.2), has all the properties of 
the simple seminormalized system (4.9). 


Theorem 5.2. Suppose that Q and A in system (1.7) satisfy inequality (4.1). Then 
under restriction 3' there is a formal change of coordinates (5.7) reducing system 
(1.7) to system (5.8), which is completely seminormalized on the formal integral 
manifold (4.2). 


Note that theorems 5.1 and 5.2 are true under restriction 3’ which is weaker 
than restriction 3 used in theorems 1.1 and 1.2. Therefore there is the situation 
when system (1.7) does not reduce to the seminormal form (1.17) since restriction 
3 does not hold, but it does reduce to system (5.8), which is completely semi- 
normalized on the manifold (4.2) if restriction 3’ is satisfied. 


5.2. Questions of Convergence 


First let us answer the question posed in problem 4.1. Let x = min|i¢P,2> + 
<Q, A>|, where Pe Z*, QeEN’, iXP,Q> + (Q,A> #0, [PI + IQ <2, j= 1, 
2.04 (O'S 0 bere = 572 ing,. 


Condition f’: B’ > —oo. 


The condition ||Q”|| <0 distinguishes the number «; from the number g,. 
That is, in calculating «; the minimum is taken only over those Q for which Q” 
is 0, —E;, or E; — E; (i,j > 1). Therefore, «; < «; and it may be that condition 
B’ holds if condition B does not. Note that condition f’ is a condition on all 
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numbers @,,..., @,, 4;,---, 4,- If condition f’ is satisfied, then restriction 3’ is 
also satisfied. 


Condition B. In the complete normal form (5.1) condition A’ is satisfied, and 


the matrix 
B’ =C" —L"a (5.9) 


is nilpotent, where L" = {Ay 4,,...,4,} is a diagonal matrix. 


Theorem 5.3. Suppose that in the setting of theorem 5.2 condition f' is satisfied 
and the complete normal form (5.1) satisfies condition B. Then the transformation 
(5.7) is analytic in some neighborhood of the initial torus 7. 


This theorem is similar to a theorem of Bruno [Bruno, 1971, theorem 4 in 
§ 10] which was formulated for the situation covered by theorem 5.1 for k = 0. 
The necessity of condition A’ and the nilpotency of the matrix B” was shown 
in the reference just cited and also in other papers [Bruno, 1974a, theorem 2; 
1974b]. 

Now let us turn to problem 4.2. Consider the set .o/’ on the manifold (4.2) 
the matrix 

ab a6 \ 


oU ov’ 
B=| _ 

Ow of 

au ay 


where L’ = {A,,...,4,} is a diagonal matrix; B’ and the matrix B” in (5.9) are 
defined on .%’. The set #’ is that subset of the set 7’ on which the matrix B’ is 
nilpotent. The set #” is that subset of Z’ on which the matrix (5.9) is nilpotent. 
Tt is clear that 


Lr ~8# 7S. 


If restriction 3 is satisfied, then we have the seminormal form (1.17) for which 
the set # is defined. According to subsection 2.4.2, B" = Bn FY. We set By = 
{U,V,W,E: U,V, Ec B’}. 


Theorem 5.4. Suppose that in the setting of theorem 5.2 condition f.' is satisfied. 
Then transformation (5.7) converges on the set By near the initial torus 7, and 
By is an analytic set. 


Since By > By then, under condition f, theorem 5.4 is a consequence of 
theorem 2.3. The advantage of theorem 5.4 lies solely in that it uses the weaker 
condition f’ on Q and A (instead of condition B of theorem 2.3). Note that 
theorem 5.3 is a particular case of theorem 5.4. In fact, if condition A’ is satisfied 
then according to example 2.1 the matrix B’ is nilpotent on the manifold Y; 
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therefore ./’ = #' = ¥. Thanks to condition B we have that Y = 4” and 
theorem 5.4 is applicable. 
5.3. Continuation of the Initial Torus 
Let !' be such that 
A; =i¢P®,Q) , PMOEZ , a Lee 


(5.10) 
A, #ICP,Q) , all Pe Z* , pol. 
For simplicity we assume that 4, =--- = 2, =0; this assumption does not 


restrict the generality of our results [Moser, 1967, introduction]. The normal 
form (1.12) has an integral manifold (4.2), which we denote by “[0] and call the 
null manifold. Now consider the set # A #[0]. The complete normal form (5.1) 
on the null manifold has the form 


U = HVE), 
Vi= PVE) , (5.11) 
V"=C"(U,V',E)V" . 

Let us make the additional assumption 


A, —4,#i<P,Q>, forall PeZ*, P#0, ijHlth..l. 
(5.12) 


Then the matrix C” does not depend on U and commutes with the diagonal 
matrix L” = {Ap4,,.--, Ar}: 


C’=C"V.E), (5.13) 
Cr s1'C": (5.14) 


This follows from the fact that the only non-zero coefficients c,;p9-s in the series 
expansion of an element c,; of C’, 


cy = >, Cypg's V'2 BE’ expi¢P, UY 
are those for which 
A, — A; =1<P,Q) + <Q'AD . (5.15) 


As long as A’ = 0 and (5.12) is satisfied then equalities (5.15) hold only for P = 0. 
Hence, (5.13) is satisfied and ¢, = 0 if 4,44, 4j=l' +1, ..., ve, (5.14) is 
satisfied. 

The set .v’ is defined by the system 


@(V',E) = Qa , (5.16) 


WV E)=0. (5.17) 
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According to theorem 2.4 the set Re «/’ consists of families of k-dimensional tori 
adjoining the torus 7. The set @” is defined by conditions (5.16), (5.17) and 
the requirement that the matrices 


ov’ 
Bi=ay> BY =C’— L'a (5.18) 


be nilpotent. The number of variables V’, £, a is [’ + n+ 1, while the number 
of conditions is k+ +l +(—-l')=k+1' +41. Hence, the number of free 
parameters in the set Bis’ +n+1—(kK++)=n+1—(k4+ J). That is, 
in an arbitrary system sufficiently many of the original parameters are required 
so that the set 2” may be distinguished from the initial torus 7. 

As seen from section 3.2, '=k and ¥ =0 for a Hamiltonian system in 
general position. Therefore the set @” is defined only by system (5.16) and by 
the nilpotency of the matrix B”. The matrix B” of order 2/ = 1! — kis Hamiltonian, 
and its characteristic polynomial contains only even powers (i.c., only / nontrivial 
coefficients). Hence, the nilpotency of the matrix B” imposes only / conditions. 
Together with system (5.16) the set #” is defined by k + [equations ink + n + 1 
parameters. That is the number of free parameters is 


ktn+1—(k+)D=n4+1-T. 


If | = 0, then the set @” always has at least one parameter (see examples 3.1 
and 3.2). If? = 1 and n = 1, then the set 9” has a single parameter (see section 3.2 
with /=1 and n= 1). In particular if the determinant of the matrix (3.28) 
(or of (3.31), or (3.32)) is different from zero, then the set Re ¥” also has one 
parameter. According to theorem 5.4 the initial torus 7 may be continued 
analytically in the parameter ¢ if conditions (5.10), (5.12), B’, and (3.28) hold. 
Moser has considered the particular case of this assertion for m = 0 [Moser, 1967, 
theorem 6]. Inequalities (5.10) and (5.12) hold under condition (1.7), condition 
(B’) holds under condition (1.7’), while condition (3.28) is Moser’s condition (6.16). 
In comparison with the situation in section 3.B there are now weaker conditions 
on 922 and A: conditions (3.15) and f are replaced by (5.10) and (5.12), and p’, 
respectively. 

Moser suggested the analytic continuation of the initial torus 7 with respect 
to the small parameter ¢, introducing into the system a certain number of new 
parameters for this purpose [Moser, 1967; also, 1966, 1969]. To do this he looked 
for those continuations with all frequencies and eigenvalues exactly the same as 
those of the initial torus 7. More recently it has been observed [Bibikov, 1973a] 
that it is not necessary to preserve the complex eigenvalues, only to preserve the 
frequencies and the eigenvalues lying on the imaginary axis. This makes it 
possible to decrease the number of additional parameters introduced into the 
system. 

Now let us consider the Moser-Bibikov construction from our point of view. 
We shall write the original system (1.7) in the form 
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X =Q 4+ &F!(X,Y,Z,¢) , 

Y = LY + eF?(X, Y,Z,#) , (5.19) 
Z=A,Z 4+ eFP(X, Y,Z,6) , 


where diagonal matrix L= {/,,...,4,}, Re; = 0, the eigenvalues of the matrix 
A, aQe fly, -.-5 Um, Rew; # 0. Instead of system (5.19) we shall consider system 


X=Q4+K+eF", 
Y=LY+N+MY+cF), (5.20) 
Z=A,Z4+eF°, 


where K = (k,,...,«,), N = (v,,...,v,) are vectors and M is a square matrix of 
order I, where 
LN=0, LM = ML (5.21) 


On the manifold #[0] system (5.20) has the complete normal form (5.1) whose 
right hand sides are 


@=Q4+ K+ O(e) + O(|V'|) + O(\E|?) 
w= N+ M'V’ + O(c) + O(|V'|?) + O(/EI*) , (5.22) 
C” = L" + M” + O(e) + O([V'|) + O(/EI’) , 


where M' and M” are square matrices of orders I’ and | — 1’, respectively. 
According to (5.21), the matrix M is block diagonal, M = {M’,M"}, where M' 
is arbitrary and M” commutes with L”: 


L"'M’ =M"'L’ . (5.23) 


E denotes the collection of all the parameters. Now we shall show that in the set 
RB" we always have a family of the form 


Vi=0, K=Ke), N=N(), M=M(e), M"=M"e); 
K=0, N=0, M=0, M"=0 for e=0. (5.24) 
By (5.22), equations (5.16) and 5.17 (for a = 1, V’ = 0) have the form 
K + O(e) + O(JE|?) = 0 
(5.25) 
N' + O(e) + O(/E|?) =0 


The matrices (5.18) for a = | are 
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B, = M’ + O(e) + O(JE|?) , 
BY = M’ + O(e) + O(\EI?) , 


and according to (5.14), L”B” = B"L”. According to the Implicit Function 
Theorem the system of equations (5.25), B, = 0, B” = 0 has a unique solution 
of the form (5.24), and condition (5.23) is satisfied. Evidently, this solution lies 
in the set 4". Thus, theorem 1 in Bibikov’s paper [Bibikov, 1973a] follows 
from theorem 5.4. In the same way one of Moser’s results also follows from 
Theorem 5.4 [Moser, 1967, theorem 1]. In Bibikov’s work the quantities Q, L, 
A,, K, Nand M are denoted by w, Q, P, 4, u, and M, respectively. 


Author’s Comments (1986) 


1. Generally speaking, each formal change of coordinates is not unique (i.e. 
the normalizing transformation, the seminormalizing transformation etc.). So 
statements of the theorems on analyticity (convergence) of such transformations 
should be understood as follows: there exists a corresponding transformation, 
which is analytic (converges)... . 

2. The sets %, and @, contain invariant subsets o&,, ¢%_ and B,, B_ 
respectively (see property 6 of section 1.5). The analyticity of the corresponding 
invariant sets o&,, %_ or B,, B_ is a consequence of the analyticity of the sets 
My or By. Similarly for sets HyaL., GpoaL_ and By nL,, Byrn L_, 
where # is a formal invariant subspace (see §§ 4 and 5). The analyticity of those 
formal invariant sets is the main result of the work. The analyticity of the other 
parts of the sets o%, and @y, is not interesting because they are not invariant. 
In my subsequent works, I changed the approach to the problem and looked 
only for analytic invariant sets [see § 3, of Ch. III of the book “Local Method...,” 
Part I in this translation]. 
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